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Preface 


Useful techniques for the design of multivariable feedback systems have been 
known for at least fifteen years, yet these techniques have remained known 
to only a relatively small part of the community of control engineers. 
This book has been written in order to spread familiarity with the techniques 
more widely. 

My objective has been to enable feedback engineers to design real 
systems, and the choice of material has been constantly, and I hope con- 
sistently, guided by that objective. The reader will therefore find that a 
complete theoretical undetstanding of the techniques discussed will some- 
times require the consultation of other text-books or research journals, but I 
believe that I have given enough detail of techniques and algorithms to allow 
complete analyses and designs to be executed. However, this is a genuine text- 
book and not just a cook-book, so most of the theory required to understand 
each technique has been included. There is also enough theoretical devel- 
opment to allow the reader to go on to read the research literature relatively 
easily. 

The basic view espoused is that sensible design of feedback systems is 
possible only if a frequency-domain point of view is adopted. Between about 
1965 and 1980, the advantages of frequency-domain approaches were 
championed by Professors Rosenbrock and MacFarlane in the UK, 
and by Professor Horowitz in Israel, at a time when most of the academic 
community - particularly in the USA—régarded the frequency domain as 
obsolete and inherently unsuitable for the solution of multivariable problems. 
The instinct of those who kept faith with the frequency domain has been 
spectacularly vindicated during the past ten years. Not only have the rather 
ad hoc techniques of the ‘British school’ proved to be useful in practical design 
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(Chapter 4), but also the central bastion of time-domain approaches, namely 
‘LQG’ optimal control theory, has been turned into an easily usable tech- 
nique which yields sensible designs, by giving it a frequency-domain interpre- 
tation (Chapter 5). And the very latest technique, H ,, optimal control theory, 
has arisen entirely as a result of frequency-domain thinking (Chapter 6). 

Despite this shift towards the frequency domain, most of the ‘time- 
domain’ content of linear systems theory -state feedback, observers, minimal 
realizations, LQG controllers and so on -remains essential for multivariable 
design. Although it is necessary to think and analyse in frequency-domain 
terms, most computational algorithms, and many proofs of theorems, 
are based on state-space methods (Chapters 6 and 8). 

I have omitted some material, such as multivariable root-loci and 
pole-placement methods, because it does not fit easily into my view of multi- 
variable feedback design, apparently being more concerned with the shaping 
of transient responses than with obtaining the potential benefits of feedback. 
On the whole I have made such omissions without misgivings, but there are 
two topics which do not appear in the book and which I feel a little 
apprehensive about leaving out. One is the ‘graph topology’ introduced by 
Vidyasagar, which may well turn out to play a fundamental role in feedback 
theory; the other is the idea of ‘internal-model control’ introduced by Morari, 
which has had some impact on the process-control industries, and which is 
closely related to the Youla parametrization introduced in Chapter 6. But 
anyone wishing to learn about these topics should have no difficulty in 
reading the relevant literature, after reading this book, and internal-model 
control appears in one of the exercises. 

I have also omitted material on discrete-time systems, because of lack 
of space. Almost everything in Chapters 1-4 and 7, and much of Chapter 8, 
holds for discrete-time systems defined by z-transform transfer functions or 
by state-space models, with some obvious modifications. The material on 
LQG design, in Chapter 5, can be given a nearly parallel development for 
discrete-time systems, but the details are considerably different. In principle, 
all the material on H„ design in Chapter 6 is applicable to discrete-time 
systems, since the bilinear transformation z=1+s/1—s can be used to 
transform discrete-time problems into continuous-time ones. But an explicit 
development of H „ theory for discrete-time systems is not yet available. 

Some feedback and control specialists hold the view that the teaching 
of feedback design should be radically revised in the light of recent advances 
in feedback theory. They would begin, for example, with the Youla para- 
metrization of all stabilizing feedback controllers, which is indeed a logical 
starting point if the subject is viewed as a branch of mathematics. In contrast 
with this view, I have adopted a more conservative sequence of presentation, 
and have begun with a review of the ‘classical’ techniques developed for the 
analysis and design of single-input, single-output feedback loops. There is a 
pedagogical advantage in this since the field of multivariable feedback 
systems can be entered painlessly by extending the ‘classical’ ideas. But there 
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is a deeper reason for adopting this approach: the ‘classical’ treatment gives 
appropriate emphasis to questions which are of real significance to engineer- 
ing design, but which may not appear important in a purely mathematical 
account of the subject. For example, there are several results in linear system 
theory which rely on the non-existence of plant zeros in the right half-plane 
(of the complex plane). An acquaintance with the classical theory makes one 
suspect immediately that the location and not the existence of such zeros 
should be significant, particularly as the non-existence of such zeros cannot 
be confirmed by any experimental means. 

In order to facilitate comparisons between alternative design methods, 
each major technique presented in this book is illustrated by being applied to 
the same system —a linearized model of an aircraft, defined in the Appendix — 
and the design specification is similar in each case. 


Readership 


This book is aimed at graduate students who have taken at least one el- 
ementary course on feedback and control systems, and who have some 
acquaintance with linear systems theory. It is possible, although difficult, to 
take a course in linear systems theory at the same time as a course based on 
this book. A more detailed discussion of prerequisites is given in Chapter 1. 
The book should also be accessible to practising feedback engineers who are 
familiar with classical servo design and have had some exposure to ‘modern’ 
(that is, post-1960) control theory. 

Most of the material has been classroom-tested in a number of 
graduate courses given at Imperial College, Cambridge University and, 
especially, the University of California at Santa Barbara, where in 1986 I gave 
a 40-hour course covering much of Chapters 1 to 5 and 7. There are a number 
of difficulties associated with teaching this material. The first is that it is 
essential for both instructor and students to have access to suitable software 
which can be run interactively. All the examples and exercises in the book 
have been solved using PC-Matlab (or Pro-Matlab), together with its associ- 
ated Control System Toolbox and Multivariable Frequency Domain Toolbox 
(and, if I were solving them again, I would use the Robust Control Toolbox for 
Chapters 5 and 6). The second difficulty to be aware of is that the design 
exercises, such as Exercises 4.1 and 4.2, take much longer to solve than 
traditional homework exercises, particularly when students are familiarizing 
themselves with new software or if there is a shortage of computer resources. 
A student cannot be expected to solve more than one, or perhaps at most two, 
such exercises per week (assuming that the course is given at a rate of 4 hours 
per week, say, and that the students are taking a typical mix of courses). It is 
also beneficial for students to solve these exercises working in pairs. The 
temptation to set lots of traditional paper-and-pencil problems, and avoid the 
design exercises, should be resisted, however. The third difficulty is that of 
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examining the students on the material. It seems entirely unsatisfactory to 
emphasize design during the course, and set computer-based design exercises 
for homework, but then examine the students on points of theory and grossly 
simplified examples in order to comply with the requirements of the tra- 
ditional three-hour examination paper. The solution I have successfully 
adopted, both at Cambridge and at Santa Barbara, is to set the class a 
realistic design exercise (possibly varying the details slightly for each student) 
and ask them to hand in solutions by some fixed time (typically 48 hours 
later). 
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re See 


1.1 Overview and prerequisites 


In this book it is assumed that the reader has taken a typical first course in the 
design of feedback systems. Such a course usually covers the Nyquist stability 
criterion, the use of Bode and root-locus plots, and the use of simple 
compensators to achieve reasonable stability margins, steady-state per- 
formance and transient response; we shall review the content of such a course 
very briefly (Section 1.2). We shall go on to discuss single-ioop feedback 
design in more depth than is usual in first courses, and in the process of doing 
so we shall establish some nomenclature and relationships which will con- 
tinue to hold when we come to examine multivariable problems in later 
chapters. 

Chapters 2 and 3 extend the concepts and results of Chapter 1 to 
multivariable systems. Chapter 2 is concerned with establishing stability 
criteria which generalize the classical Nyquist criterion. In order to do this it 
is necessary to define poles and zeros of multivariable systems, and to tangle 
with some linear systems theory; most of the results obtained also find 
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application in later chapters. Chapter 3 deals with the analysis of per- 
formance and stability margins of multivariable systems. In addition to 
straightforward extensions of the results of Chapter 1, Chapter 3 presents a 
considerable amount of recent material on analysing the robustness of 
feedback systems in the face of specific disturbances or parameter variations 
which goes beyond the classical notions of gain and phase margins. 

Chapters 4 to 7 are concerned with design techniques for multi- 
variable feedback systems. Chapter 4 deals mostly with direct extensions of 
classical methods to multivariable systems. These methods have become 
known as the ‘British school’ of multivariable design, and provide the 
simplest and most easily comprehensible design techniques. Chapter 5 
describes the use of ‘linear quadratic Gaussian’ control theory in such a 
way that sensible feedback designs are obtained. This involves analysing 
the resulting designs in the frequency-domain terms developed in Chapters 2 
and 3. Chapter 6 moves to the very new area of H „ optimal control. This 
approach is of great current interest, and has provided some new fundamen- 
tal results about feedback systems, as well as a very powerful design techni- 
que. Chapter 7 describes some methods of design by parameter optimization; 
these are relatively ‘brute-force’ approaches, but sometimes they have to be 
resorted to when other techniques either fail or are inapplicable, and they can 
produce excellent designs. 

Finally, Chapter 8 discusses the software which is needed to do any 
analysis or design in the realm of multivariable systems. 

The logical interdependence of the chapters can be represented as in 
the diagram below. Thus the reader interested mainly in practical design 
techniques may concentrate initially on Chapters 1, 2, 3, 4 and 7, while a 
research student may prefer Chapters 1, 2, 3 and 6. All readers are urged to 
look at Chapter 8, even if they skip most of the details of Sections 8.2 and 8.3. 


Since the basic representation of a multivariable system which we use 
is the transfer-function matrix -that is, a matrix whose elements are transfer 
functions—some familiarity with linear algebra is essential to the reader 
wishing to progress beyond Chapter 1. At the very least, he should be familiar 
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with basic matrix manipulation and with eigenvalues and eigenvectors. If he 
is also familiar with the singular-value decomposition, the rank of a matrix, 
and notions such as column span or invariant subspace, he will be quite 
adequately prepared for the whole of the book. 

A little knowledge of linear systems theory is also assumed in most 
chapters: the reader should know what is meant by the terms state-space 
model, controllable, observable, minimal realization and state feedback. Pre- 
vious exposure to the theory of linear quadratic optimal control and/or 
Kalman filtering would also be helpful for Chapters 5 and 6, but is not 
essential. Readers without this background should consult Åström and 
Wittenmark (1984) or Franklin and Powell (1980), or (for a thorough 
treatment) Kailath (1980) or Chen (1984). Chapters 3 and 5 make some use of 
spectral densities to describe stochastic processes; spectral densities also 
make occasional appearances in examples and discussions in other chapters. 


1.2 Review of elementary feedback design 


We shail confine our attention to linear, time-invariant systems whose 
input/output behaviour is governed by a set of ordinary linear differential 
equations with constant coefficients. A simple example of such a system is 


P(t) +a, H(t) +a, y(t) = b u(t)+b u(t) (1.1) 


in which u(t) represents an input signal and y(t) an output signal. If we apply 
the Laplace transform to (1.1) we obtain 


(s* +a,8 +4) ¥(s) = (bjs +b )U(s)+(s+a,)y(0) +a, $(0)—b, u(0) 
(1.2) 
and if we set y(0)=(0)=u(0)=0 we obtain the transfer function 


¥(s)  bis+b, 
U(s) s*+a,s+a, 


(1.3) 


We shall usually represent systems by transfer functions such as these. 
Typically, we shall use a notation such as G(s) to denote a transfer function, 
and we shall frequently abbreviate this to G. 

If the transfer function G(s) is rational (that is, a ratio of polynomials), 
as in (1.3), then its denominator polynomial is usually the characteristic 
polynomial of the original differential equation, and its roots (the character- 
istic roots) therefore determine the dynamic characteristics of the system—in 
particular, their location in the complex plane determines the stability of the 
system. We say ‘usually’ because it is possible for cancellations to occur 
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Figure 1.1 Negative-feedback connection of two linear systems. 


between the numerator and denominator, so concealing some of the system’s 
dynamics. A full elucidation of the causes and effects of such cancellations can 
be given by using state-space representations of systems rather than transfer 
functions; ‘hidden dynamics’ then correspond to uncontrollable or unobserv- 
able modes (Kailath, 1980). However, we do not wish to pursue this here, and 
shall refer rather vaguely to ‘hidden modes’ when necessary. 

The roots of the numerator polynomial are called the zeros of the 
transfer function, and the roots of the denominator are called its poles. Thus 
the stability of a system is usually determined by its poles. 

Two systems, with transfer functions G(s) and H(s), respectively, may 
be connected together in the negative-feedback configuration shown in 
Figure 1.1. Then, subject to certain conditions (for example, if the signals 
are voltages then input impedances must be much greater than output 
impedances), the transfer function of the closed-loop system is given by 


Y(s) G(s) 
R(s) 1+H(s)G(s) aa 


The stability of this system is determined by the roots of the closed-loop 
characteristic equation 


1+H(s)G(s)=0 (1.5) 


(or, equivalently, the closed-loop poles) provided no hidden unstable modes 
have been introduced by the connection of the two systems — that is, provided 
no cancellation of right half-plane poles and zeros has occurred when 
forming the product HG. 

Now suppose that the designer of the feedback system has the freedom 
to change G or H. For any particular design he can easily check where the 
solutions of (1.5) lie. But if some of them are in the right half-plane, or so close 
to the imaginary axis as to give unacceptably resonant behaviour, then it is 
almost impossible to see from (1.5) how G or H should be changed for the 
better. At this point salvation is provided by the Nyquist stability theorem: 
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Figure 1.2 Nyquist locus of closed-loop stable system (solid curve) and unstable 
system (dashed curve); it is assumed that the open-loop system is stable. 


Theorem 1.1: The closed-loop system is stable if and only if the 
graph of H(jw)G(ja), for — œ <w < œ, encircles the point — 1 +j0 as 
many times anticlockwise as H(s)G(s) has right half-plane poles 
(provided there are no hidden unstable modes). 


If the stability condition of this theorem is violated or nearly violated, it is 
relatively easy to see the changes which must be made to G or H. 

Most commonly HG has no unstable poles, in which case the graph of 
H(jæ)G(jæ), or Nyquist locus, of a closed-loop stable system is shown by the 
solid line in Figure 1.2, while that of an unstable system is shown by the 
dashed line; in each case only half the locus (for 0<w< œ) is shown. These 
can also be displayed on a Bode plot, as in Figure, 1.3. For the stable system 
the phase margin ¢ is shown in Figures 1.2 and 1.3, and the gain margin y is 
shown in Figure 1.3. 

If the closed-loop system is unstable, or if these stability margins are 
judged to be insufficient, then one or more compensators can be inserted into 
the feedback loop. One of the simplest possible compensators is a system 
whose transfer function is 


_ a(1+sT) 


The Bode plot of this transfer function is shown in Figure 1.4, for a<1. If 
a<1 the compensator is said to provide phase lead (or phase advance), since 
arg K(jw)>0 for all @>0 in this case, while if a> 1 it is said to provide phase 
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log |HG| 


Figure 1.3 Bode plot form of Figure 1.2. 


log |K| 


loga 1/T T/a  1Ta 


1/T 1/T\/a 1/Ta 


a(l+sT) . 
Figure 1.4 Bode plot of ———_~ with «<1. 
1+saT 
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Figure 1.5 Typical Bode plots of system after compensation by phase lead (solid 
curves) or phase lag (broken curves); curves marked 2 are the same as those marked 
1, but with adjustment of gain to achieve specified phase margin ¢. 


lag since arg K(jw)<0. The greatest phase change (i.e. value of arg K(jo)) 
occurs at the frequency w,,=1/T. /a, which is midway between the corner or 
break frequencies œ = 1/T and w=1/Tx on a Bode plot, since the frequency- 
axis calibration is logarithmic, and the value of this maximum phase change 


arg K(i) = tan] (2a) | (1.7) 


Figure 1.5 shows typical Bode plots of H(jw)G(j@) K (jo) when K is a 
phase-lead (solid curves) or a phase-lag (dashed curves) compensator; it also 
shows them after a further gain adjustment (that is, insertion of a constant 
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Figure 1.6 M-circles. 


gain into the feedback loop) to achieve a specified phase margin ¢. Although 
given stability margins can usually be achieved by following either a phase- 
lead or a phase-lag strategy, it is seen from Figure 1.5 that the phase-lead 
strategy increases both the cross-over frequency, namely the value of w for 
which | H(jw)G(j@)K(jw)|=1, and also the gain at high frequencies. This 
leads to faster transient responses being achieved with the phase-lead strat- 
egy, but at the expense of higher sensitivity to noise and larger signal levels 
appearing in the loop. In practice, compensators are frequently more compli- 
cated than the one defined by (1.6); they may exhibit both phase-lead and 
phase-lag characteristics (over disjoint frequency ranges), and they may have 
both real and complex poles and zeros. 

An alternative way of specifying stability margins is to require the 
Nyquist locus to remain outside some neighbourhood of the point — 1 + j0. 
Such neighbourhoods are usually defined by M-circles. These are the loci of 
points z in the complex plane, for which 


=M (1.8) 


Z 
1+z 


where M is some positive real number. Some M-circles are shown in 
Figure 1.6; it can be seen that all those M-circles for which M > 1 enclose the 
point — 1 +j0, and that the circles become smaller as M becomes larger. To 
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Figure 1.7 Nichols chart, showing a Nyquist locus (solid curve) and some M- 
circles (broken curves). 


see why M-circles give useful stability margin specifications, suppose that, in 
Figure 1.1, we have H(s)=1, and that G(s) already includes any compen- 
sators. Then the closed-loop gain from r to y, at frequency œ, is given by 


G(jo) 
14+G(jo) co 


So if the Nyquist locus of G(s) is allowed to penetrate a high-valued M-circle 
over some range of frequencies, then the closed-loop frequency response will 
exhibit a large peak at these frequencies. This in turn indicates the presence of 
resonance in the closed-loop system, which is usually undesirable for prac- 
tical reasons (rapid deterioration of components, heavy power consumption), 
and which roust be caused by the presence of some lightly damped closed- 
loop poles close to the stability boundary. 

Specifying that the Nyquist locus should remain outside the M =,/2 
M-circle, for example, therefore imposes some minimum degree of damping 
on the closed-loop poles (with the proviso, as usual, that the Nyquist locus 
cannot convey information about poles which are masked, or nearly masked, 
by zeros). 

It is often useful to display a Nyquist locus on a Nichols chart, namely 
with log|G(j@)| plotted against arg G(jm). Figure 1.7 shows an example, 


with some M-circles superimposed. The M-circles are now distorted into 
non-circular shapes. 
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The approach to feedback design which has been reviewed here will 
serve as the basis for the multivariable techniques to be described later. Root- 
locus and pole-placement techniques have not been reviewed, because they 
will not be needed later in this book. 


1.3 A standard problem 


Figure 1.8 shows the form in which we shall pose the feedback design 
problem. The ‘plant’, which is assumed to be given and unalterable, is 
represented by the transfer function G(s) and the (transform of the) disturb- 
ance signal d(s), its input and output signals being u(s) and y(s), respectively. 
The function r(s) represents a ‘reference’ or ‘command’ signal, which is to be 
followed by the plant output y(s), and m(s) represents measurement errors. 
K(s) and P(s) represent dynamic systems which are to be designed — we shall 
refer to these as the ‘feedback compensator’ and the ‘pre-filter’, respectively. 

At first sight, most feedback design problems do not appear to have 
the structure of Figure 1.8. Disturbance signals often appear at the input of 
the plant, rather than (or in addition to) its output. For'example, if the plant 
consists of an inertial load such as an aerial dish, and the input signal u(s) 
represents the driving torque, then the major disturbance may appear as a 
random component of this torque. Also, transfer functions may appear in the 
feedback path, either because that turns out to be the only practical place to 
insert compensation in a particular case, or because the sensing device used 
to measure the plant output itself has significant dynamics. For example, if 
y(s) represents the angular velocity about one axis of an artificial satellite, 
this may be measured by a rate gyro which (together with its signal- 
conditioning circuitry) has second-order dynamics. A further way in which 
the structure of a real problem may differ from that of Figure 1.8 is that there 
may be more than one ‘loop’ to design (for example, a fast velocity loop inside 


d(s) 


Controller Plant 


m(s) 


Figure 1.8 Standard feedback configuration. 
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n(s) 


m(s) 


Figure 1.9 Another possible feedback configuration, often equivalent to that 
shown in Figure 1.8. 


a slower position loop) or the compensator to be designed may be distributed 
around the system in a more complicated way than is shown in Figure 1.8. 

However, since we have assumed the system to be linear, it is usually 
possible to obtain (by use of ‘block-diagram algebra’) a representation of the 
problem which does have the structure of Figure 1.8. For example, the system 
shown in Figure 1.9 is equivalent to that of Figure 1.8 if 


P(s)=Q"*(s)R(s) 


K(s)=C(s)Q(s) 
and 


d(s)=G(s)n(s) 


In this case the feedback design problem is not quite the same as it 
would have been had we started with Figure 1.8, because K (s) and P(s) are 
now constrained to some extent. Whether these constraints alter the problem 
significantly will depend on the details of Q(s). So far, we have not assumed 
that the signals appearing in Figures 1.8 and 1.9 are single variables. They 
may be vectors of variables, in which case the transfer functions will be 
matrices. Now, if the measurement device represented by Q(s) has unequal 
numbers of inputs and outputs, then Q(s) will not be a square matrix, and 
Q~*(s) will not exist. In such a case Figures 1.8 and 1.9 are not equivalent, 
unless one is dealing with a ‘regulator’ problem, for which r(s)=0. 

Our justification for using the structure of Figure 1.8 is that either a 
given problem can be recast in that structure or, if it cannot, then the general 
results which we shall develop will still hold, although some details are likely 
to change. 

A multi-loop or multi-compensator problem can usually be put into 
the form of Figure 1.8 by allowing the transfer functions to be matrices, and 
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m(s) 


Figure 1.10 Feedback configuration with feedforward control of disturbances. 


the signals to be vectors. However, it is better to decompose the problem into 
a sequence of single-loop problems if possible (typically, designing fast inner 
loops first and slow outer loops later) since a sequence of single-loop 
problems is usually much easier to solve than a single multivariable problem. 

We assume that the measurement error m is unknown, since it could 
be corrected for if it were known. However, it may be permissible to assume 
that it can to some extent be described. Most frequently it is described as a 
stochastic process, with a known power spectral density (most commonly 
flat). Another common approach is to assume the measurement error to have 
a constant but unknown value -this describes systematic bias in the measure- 
ment. However, such bias.can be corrected only if more than one independent 
measurement of the output is available, which is not the case for the 
configuration shown in Figure 1.8. We therefore assume that any systematic 
bias in the measurement has already been corrected. 

The disturbance d is also assumed to be unknown. In practice, it 
frequently happens that one can measure or estimate part or all of a 
disturbance. For example, if it is caused by a change in the composition of 
some raw material being fed into a process, that change may be detected and 
measured before the material even enters the process. Suppose that 


d(s)=d,(s)+4d,(s) 


and that d, can be measured. Then the controller should include a ‘feed- 
forward’ compensator, as shown by the block F (s) in Figure 1.10. If 
F(s)= —G~‘(s), then the disturbance d, is completely cancelled by the feed- 
forward compensator, and Figure 1.10 is again equivalent to Figure 1.8, if we 
take d(s)=d,(s). For several reasons, it is impossible to set F(s)= — G7 1(s) 
exactly. Perhaps the most fundamental is that usually G~+(s) is not the 
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transfer function of any realizable system. Also, G(s) is usually known only 
approximately (this is often the reason for using feedback in the first place), 
and even if F(s) only approximates —G~1(s), this may require impossibly 
large and rapid changes of the control signal u(s) in response to certain 
disturbance signals. Finally, if the plant is either unstable, or has zeros in the 
right half-plane, then series compensation by G~!(s) would result in unstable 
modes which were either uncontrollable or unobservable (Kailath, 1980), and 
which therefore could not be stabilized. In spite of these limitations, it clearly 
makes sense to use any available information about disturbances in this 
manner, and doing so often makes the feedback design task dramatically 
easier. Therefore, when we start with Figure 1.8 we assume that any detailed 
information about disturbances has already been exploited in a feedforward 
scheme, and that d(s) in Figure 1.8 represents the effective disturbance which 
remains, and which is therefore unknown. 

It should be added that the ‘disturbance’ d can represent various 
sources of uncertainty in the plant. For example, suppose that we believe the 
true transfer function of the plant to be G+A rather than G, but we do not 
know A -we may have only a bound on the magnitude of A, for example. This 
can again be represented by Figure 1.8 if we take d(s)= A(s)u(s) (which is of 
course unknown, even though u(s) is known). 


1.4 Fundamental relations 


From Figure 1.8 we see that 
y(s)=d(s) + G(s) K(s) [P(s)r(s)—m(s)—y(s)] (1.10) 
Hence, 
LI + G(s)K(s)]y(s)=d(s)+ G(s)K(s)[P(s)r(s)—m(s)] (1.11) 


where J is the unit matrix (we have yet to restrict the discussion to single-loop 
systems). If we define the return difference 


F,(s)=I+G(s)K(s) (1.12) 


(the subscript denoting that the return difference is evaluated at the output of 
the plant), and the sensitivity function 


S(s)= Fy '(s) (1.13) 
then we can write 


y(s)=S(s)d(s)+G,(s)r(s)—T(s)m(s) (1.14) 
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where 

T(s)=S(s)G(s)K(s) (1.15) 
and 

G,(s) = T(s)P(s) (1.16) 


We shall call T(s) the closed-loop transfer function. 

It should be noted that S(s) and 7(s) depend only on those transfer 
functions which are inside the feedback loop, while the pre-filter P(s) affects 
only G,(s). Consequently, we can consider first the problem of designing K(s) 
to obtain desired S(s) and T(s), and subsequently design P(s) to give a 
suitable G,(s). This second step is a matter of open-loop series compensation, 
and so is not strictly a feedback design problem. In many cases the ideal 
solution would be to set P(s) = T7 ‘(s) (because of equation (1.16)), but we 
cannot do this because of the same obstacles that prevent perfect feedforward 
control from being realized. 

In many (perhaps most) applications the pre-filter is fixed, most 
commonly as the unit matrix. In this case G,(s) and T (s) cannot be adjusted 
independently of each other, and in the most common case are identical. We 
then have a ‘one degree of freedom’ design problem, whereas the general 
problem represented by Figure 1.8 is known as the ‘two degrees of freedom’ 
problem. We shall consider the ‘one degree of freedom’ problem, namely the 
design of K(s) only, almost exclusively. (In many cases a pre-filter is effec- 
tively supplied by a plant operator, or a pilot, who learns to compensate for 
T(s) by giving the reference signal a different profile from the one he wishes 
the output to follow.) 

From equation (1.14) we see that, if the feedback loop is to be 
successful in attenuating the effects of disturbances acting on the plant, and if 
the output is to be insensitive to measurement errors, then we need both S(s) 
and 7(s) to be small in some sense. However, from (1.12) and (1.15) we obtain 


T(s)+S(s)=I (1.17) 


This shows that, if we succeed in making S(s) nearly zero, then T(s) will be 
nearly the identity; and conversely, if we make T(s) nearly zero, then S(s) will 
be nearly the identity. We thus have an unavoidable trade-off between 
attenuating disturbances and filtering out measurement error. This trade-off 
is one of the factors which make feedback design difficult. 

The closed-loop transfer function T(s) is often referred to as the 
complementary sensitivity, because of relation (1.17). This is useful for distin- 
guishing T(s) from other possible closed-loop transfer functions. 

In process control, signals from measurement sensors are often effec- 
tively free from error because the time constants which occur are large 
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enough to allow almost complete elimination of electrical noise by filtering. 
(However, measurements may be noisy for other than electrical reasons 
—consider measuring the level of a boiling liquid, for example.) It may appear 
that the trade-off is avoided in such a situation. Unfortunately this is not 
so, because in process control it is invariably a requirement that control 
signal movements should be kept reasonably small, and it turns out that this 
is equivalent to the requirement that T(s) should be kept small (at least in the 
single-loop case), as we shall now show. 
From Figure 1.8 we see that 


u(s)= K(s)[P(s)r(s)—m(s)—d(s)— G(s)u(s) ] (1.18) 
Hence 

[I+K(s)G(s)]u(s)=K(s)[P(s)r(s)—m(s)—d(s)] (1.19) 

If we define the return difference at the input of the plant to be 

F,(s)=1+K(s)G(s) (1.20) 


then we can write | 
u(s) = F; *(s)K(s)[P(s)r(s)—m(s)—d(s)] (1.21) 


(F;+(s) is sometimes called the input sensitivity, in which case S(s)=F > !(s) 
is called the output sensitivity.) 
For the rest of this chapter we shall confine ourselves to single-loop 


feedback. In this case the two return differences are the same, and we can 
define 


F(s) = F,(s)=F,(s) (1.22) 


Let us measure the strength of variation of the control signal by 
fo u?(t)dt. Then, by Parseval’s theorem (Oppenheim et al., 1983), we have 


i u*(t)dt = =| | u(jo) |? dw (1.23) 


0 =% 


where we have used u(t) to denote the input signal as a function of time, and 
u(s) to denote its Laplace transform. In writing equation (1.23) we have 
assumed that the feedback loop is stable, and that the control signal is 
responding to a temporary disturbance- otherwise neither integral would 
exist. 

If a wide class of disturbances is likely to occur, or if the disturbance is 
well defined but its energy spectrum extends over a wide frequency range, 
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then we can see from equation (1.21) that the only way of ensuring that 
f lu(jæ)|? dw is small is to keep F~1(jw)K(ja) small. But 


ssj AA K(jæ) 
F (I) KO) = TE anyon) (1.24) 
_ T(jo) 
6G) (1.25) 


So we see that | 7( jo)| must be kept small, except possibly at those fre- 
quencies at which | G(ja)| is sufficiently large, if any such frequencies exist. 


1.5 The ‘shape’ of the solution 


In practice, the conflict between keeping 7(s) small and keeping S(s) small is 
resolved by making one small at some frequencies, and the other small at 
other frequencies. Usually the spectra of reference signals and disturbances 
are concentrated at low frequencies, while the spectrum of measurement 
errors extends over a much wider frequency range. The solution which is 
almost universally adopted is therefore to make |S(jæ)| small at low fre- 
quencies (for O0<@<@po, say) and | T(jæ)| small at high frequencies (for 
@> a@,, say). Even if no measurement noise is present, equation (1.25) shows 
that | T(j@)| should be made as small as possible at frequencies above those 
at which disturbances (and reference signals, if P(s)=1) occur. 

From equation (1.12) we see that, to make S(s) small, we must make 
|1+ G(s)K(s)| large, and hence | G(s)K(s)| must be large. If | G(j@)| is not 
itself sufficiently large at low frequencies, then | K(j@)| must be made large 
there. We also see that, to make 7(s) small, and hence S(s) close to unity (see 
equation (1.17)), we need to make | G(jw)K(j@)| close to zero. Now, all real 
plants have the characteristic that | G(jw)|—0 as w— œ, so it may be enough 
to require | K(j@)| not to increase at high frequencies, but if the rate of gain 
reduction with frequency of the plant is not fast enough, then | K(ja)| is 
required to be small at high frequencies. 


EXAMPLE 1.1 


A regulator (r=0) is subject to an output disturbance which consists of a 
sine wave of amplitude A and frequency wọ, superimposed on occasional 
steps (that is, a piecewise-constant function) of unpredictable magnitude. 

Its output is to remain within the bounds + 4/10 (ignoring any 
transients from the occasional steps), and this is to be done with minimal 
use of control energy. 
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Figure 1.11 Compensator gain characteristic to cope with single-frequency 
disturbance. 


We can meet this requirement by ensuring that 


(1) S(0)=0 (by the final value theorem for Laplace transforms): 
|K(Q)|=00, if |G(0)|< œ 
(2) | S(j@o) |< 1/10: 
| K(j@o) G(jao)+1|> 10 
or | K(j@o) G(ja_)| = 10 (approximately) 


(3) |K(j@)| is as small as possible, consistent with 1 and 2; thus 
| K(j@)| has the ‘shape’ shown in Figure 1.11. 


Usually, the output disturbance is not concentrated at one frequency, 
but is known to have significant content up to some frequency wọ. To get 
acceptable performance, we then require that 

|S(ja)|<e, 0<HM<wo 
which implies that 


| K(j@)G(j@)|> 1/e (approximately), O<w<ap5 


frequently with |K(0)G(0)|=oo as an additional constraint; Figure 1.12 
shows a typical plot of the ‘open-loop gain’ | K(jw@)G(ja@)|. Recall that 


T(s)=1—S(s) 
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Figure 1.12 Typical specification and characteristic of compensator gain. 


so corresponding to the above, we have 

| T(j@)|=|1—S(jo)|~1, for 0<m<a, ife«1 
and 

| T(0)|=1 


Figure 1.13 shows the typical ‘shapes’ of | S(j@)| and | T(j@)| which 
result. 


0 wo WO, 


Figure 1.13 Typical gain characteristics of sensitivity and complementary 
sensitivity. 
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$ 


Figure 1.14 Closed-loop response to step disturbance on output. 


We call œ, the bandwidth of the feedback loop and define it, rather 
arbitrarily, as the lowest frequency such that 


| T(jo,)| = | T0)|/,/2 


Note that œ, may be considerably higher than w,. Clearly the requirement 
that | T(j@)| should fall as quickly as possible for œw >w, translates into the 
requirement that œ, should be as small as possible. 

Now, it is known that (roughly speaking) the bandwidth of the loop 
is inversely proportional to the response time of y(t), in response to a 
step disturbance—see Figure 1.14. So keeping the bandwidth small implies 
keeping its response relatively ‘sluggish’. Note that if P(s)=1, the response to 
a step on the disturbance is a ‘mirror image’ of the response to a step on the 
reference input, so the ‘tracking’ step response is also relatively slow. 

The loop bandwidth œ, is usually very close to the ‘0 dB cross-over’ 
frequency œ., at which |G(j@,)K(j@,)| = 1. Figure 1.15 shows a Nyquist 
locus. At the cross-over frequency œ, the locus intersects the origin-centred 
unit circle, while at the loop bandwidth frequency œ, it intersects the 
M=1/,/2 circle. If the locus were to lie entirely along the negative imaginary 
axis, these two intersections would occur at the same frequency, and hence we 
would have œ, =w.. If the locus lay entirely along the negative real axis, then 
the rate of change of gain with frequency would be quadratic (40 dB/decade), 
as can be seen by considering the example of two integrators in series. Since 
the M=1/,/2 circle crosses the negative real axis at — 0.414, the loop 
bandwidth would in this case be given by w,=1.55w,. In practice Nyquist 
loci usually lie between these two extremes in the region shown in Figure 1.15. 
As we shall see in Section 1.7.1, a limit on the rate of change of gain with 
frequency can be obtained from the phase angle, and this enables us to obtain 
a third estimate. Suppose that the Nyquist locus is as shown in Figure 1.15, 
entering the M = 1/,/2 circle at — 0.414 +j0, and let us take its phase, between 
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Figure 1.15 Typical Nyquist locus, showing the M=1 I 2, M= J 2 and |S|=1 
circles. 


` the cross-over frequency and the loop bandwidth, to be — 135°. This corre- 
sponds to the gain changing as œ~ +° (— 30 dB/decade), at most. Hence in this 
case œ = 1.8%.. These calculations are approximate, of course, but they do 
illustrate that for typical, acceptable designs we can estimate the loop 
bandwidth in terms of the cross-over frequency by 


0, <0, <20, (1.26) 


In general, we have a second ‘degree of freedom’ because P(s) need not 
be just unity. In this case it is useful to distinguish between the ‘loop 
bandwidth’ and the ‘transmission bandwidth’. 

The loop bandwidth is the bandwidth of 


_ G(s) K(s) 
M3)= 7 GQOK(N) (1.27) 


and indicates the ‘speed of elimination’ of a disturbance. 
The transmission bandwidth is the bandwidth of 


G(s)K(s)P(s) 


1+G(s)K(s) G 


G.(s)= 


and indicates the ‘speed of response’ to the reference signal. P(s) is usually 
used to extend the transmission bandwidth beyond the loop bandwidth, but 
of course this demands increased control energy. 
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EXAMPLE 1.2 


Suppose all three signals acting on the system (r, d and m) are stochastic 
processes with spectral densities ®,(@), (œw) and ®,,(@), respectively. 
Define the tracking error e(t) by 


e(t)=r(t)— y(t) 


Suppose we wish to minimize the variance of the error, E{e*(t)}. From 
equation (1.10) we see that 


e(s)=[1—T7(s)]d(s)+[G,(s)—1]r(s)— T(s)m(s) (1.29) 


If we assume that d, r and m are mutually uncorrelated, and let the 
spectral density of e be ®©, (œw), then we obtain 


®,(@) = |1—T(jo) |? ®,(@) +|1-G.(jo) |? ®,(@) 


+| T(jo) |? ®,,(@) (1.30) 
But 
E{e(t)} -f ®,(@)do (1.31) 


so it is clear that, to keep the error small, we must keep | 1 — T(jw)| small 
when ®,(q) is large, | 7(j@)| small when ®,,(«) is large and | 1—G,(ja) | 
small when ®,(q@) is large. l 

The quality of control is limited by the extent to which the spectra 
of d and m are separated (and their relative magnitudes). 

Once again, the ‘shape’ of a solution becomes clear: | K(jw)| large 
enough to keep |S(ja)| small for 0<@<wo, and | K(ja)| as small as 
possible for @>p, for some frequency wo. 

If P(s)=1 then G.(s)= T(s), and the transmission and loop band- 
widths coincide. Thus the better the system responds to the reference 
signal, the more it will respond to the (misleading) measurement noise. If 
there is significant overlap between ®,(w) and ©,,(q@), the loop bandwidth 
should be kept low, and P(s) (#1) should be used to increase the 
transmission bandwidth. 


a a 2 EES 


The loop bandwidth should be just large enough to give sufficient 
attenuation of disturbances, including those representing uncertainty about 
the plant model. It follows that, if there were no disturbances, or in other 
words if there were no uncertainty about the behaviour of the plant, then the 
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loop bandwidth should be zero; that is, in this case we should have no 
feedback, but should use open-loop series compensation alone. Of course, 
such an ideal situation never occurs, and is very rarely even approached, but 
it serves to illustrate that the purpose of using feedback is to combat 
uncertainty, and is not just to modify a system’s input/output behaviour. 


1.6 Two approaches to design 


From equation (1.15) we see that 
K(s) = G7 1(s)F(s)T7(s) (1.32) 


which suggests an obvious approach to design: we just need to choose either 
F(s) or T(s) to satisfy the requirements described in Section 1.5, and stability 
requirements, and then obtain K(s) from equation (1.32). The feedback loop 
will be stable if F(s) has all its zeros in the open left half-plane, or equival- 
ently, if T(s) has all its poles there. 

The fundamental objection to this ‘closed-loop’ approach is that it 
forces the designer to pick one particular F(s) (or T(s)) from the infinitely 
many which would be equally suitable. This choice is essentially arbitrary, in 
so far as it ignores the characteristics of the plant G(s). It is probable that 
choosing a different F(s) would yield a K(s) which would be preferable, even 
though both may meet closed-loop requirements. For example, one choice of 
F(s) may result in K(s) being unstable, whereas another may not. Stable 
controllers are preferred to unstable ones, primarily because they are easier to 
set up and test before inserting them into the feedback loop. It is virtually 
impossible to determine the most appropriate choice of F(s) directly -that is, 
without examining the open-loop characteristics of the plant. 

If the objective of design is very narrowly defined—‘minimize the 
variance of the error’, for example—then this approach is more attractive, 
since a unique F(s) can be found which meets this objective. The 1950s 
approach to optimal design, based on Wiener filter theory, took this path 
(Newton et al., 1957). But even in such a case the mathematically expressed 
objective is likely to be somewhat artificial, and the designer may well prefer a 
solution which is not quite ‘optimal’ because it is better in a sense which defies 
mathematical description. A more detailed assessment of this ‘closed-loop’ 
approach to design has been given by Horowitz (1963). 

There are also less fundamental, more technical objections to the 
‘closed-loop’ approach. In equation (1.32), K(s) depends on G~ t(s). If G(s) 
has poles or zeros in the right half-plane, unstabilizable (because uncontrol- 
lable or unobservable) modes will be built in to the feedback loop. If F(s) or 
T(s) is chosen so that |F(j@)7T(j@)| approaches zero at high frequency less 
quickly than |G(ja)| does, then K(s) will be unrealizable. These objections 
are not insuperable because it is possible to choose the closed-loop specifi- 
cations so as to overcome them. For example, if F(s) has poles at those points 
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at which G(s) has unstable poles, then these poles will not be cancelled by 
K(s). (See Franklin and Powell (1980) for a fuller discussion.) 

However, if F(s) is obtained as the solution to some optimization 
problem, then the designer does not have the opportunity of ensuring that 
F(s) will have the special features required to give a realistic design. It is, 
nevertheless, possible to proceed with a ‘closed-loop’ approach to design, by 
working in a rather more elaborate framework. The so-called ‘Youla para- 
metrization’ yields a means of solving general feedback-optimization prob- 
lems (most commonly optimizing some closed-loop properties), with the 
assurance that the solution is a realizable controller which does not attempt 
to cancel right half-plane poles or zeros. This parametrization will be 
described in detail in Chapter 6. 

- By far the most common approach to feedback design is to work 
directly with the ‘open-loop’ return ratio G(s) K(s). As we have already seen, 
the designer starts by knowing roughly the characteristics that |G(jw)K( jæ)| 
must have — large up to some frequency, as small as possible beyond that. He 
probably has some idea of the frequency region in which the transition from 
high gain (large |GK|) to low gain should occur, and he knows, from the 
Nyquist stability theorem, how the graph of G(jw)K(ja) should behave in 
relation to the point — 1 + j0. He may have much more precise information 
about the open-loop requirements, which are derived from closed-loop 
specifications. These requirements are usually stated in the following form: 


(1) |G(jo)K(jo)| > L (>1), for 0 < w < o; 
(2) IG(jæ)K(jæ)| <e(«1), for a> œ; 
(3) Gain margin > u, phase margin > ¢; 
( The graph of G(j@)K(jq) is to remain outside a neighbourhood of 
the point — 1 + j0, defined by a particular M-circle (most commonly 


M =J). 


FN 
— 


To some extent (3) and (4) are alternatives. 
These requirements may vary. Commonly met additional require- 


ments are that |G(0)K(0)|= œ, which ensures zero steady-state error in the 
face of step disturbances, or that 


lim @|G(j@)K(jo)| = 0 
a7>0 


which ensures zero steady-state error in the face of ramp disturbances. These 
two are the most common examples of the application of a general principle 
which states that the long-term error in the face of a persistent disturbance 
(that is, one which does not decay to zero with time) can be zero only if the 
poles of (the transform of) the disturbance are included among the poles of 
the return ratio G(s)K(s). In the past fifteen years this principle has been 
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shown to hold in very general situations, and has been called the ‘internal- 
model principle’ (Astrém and Wittenmark, 1984; Wonham, 1974), although 
its essence was known to the pioneers of feedback design methods. 

Another departure from the standard form of specifications which can 
occur is that the required gain-frequency behaviour may not be monotonic 
decreasing. For example, it may be necessary to reduce the loop gain in the 
region of a resonance, but to keep it large at both lower and higher 
frequencies. As another example, it may be necessary to keep the loop gain 
almost constant over a range of frequencies if the plant has a zero in the right 
half-plane. (In such cases terms like ‘loop bandwidth’ should be used with 
care.) 

The attraction of working with the open-loop rather than the closed- 
loop transfer functions is that it is easy to see what features K(j@) should 
have if G(jw)K (ja) is to satisfy the requirements, and it is usually relatively 
easy to find a transfer function K(s) which has these features. Furthermore, 
the design can be pursued in stages: for example, meeting high-frequency gain 
reduction requirements first, then stability margins, and finally low-frequency 
gain enhancement. Usually such stages ‘interfere’ with one another, and it 
may be better to think of each stage as giving a better approximation to the 
required characteristics of G(s)K(s). (Breaking the design into stages corre- 
sponds to obtaining K(s) as a product, each factor being the transfer function 
obiained at each stage.) 

Graphical displays of G(j@)K(ja), in the form of Nyquist, Bode or 
Nichols plots, enable the designer to make a rapid visual assessment of the 
degree to which the requirements have been met, and, with a little practice, to 
judge the seriousness of any violations of the requirements. 


1.7 Limitations on performance 


The question arises of whether it is always possible to meet specifications for 
G(ja@)K (jæ) by using some K(s). It turns out that it is not possible, because 
the gain-frequency behaviour and phase-frequency behaviour are not 
independent of each other. 


1.7.1 Gain-phase relationships 


Suppose that Q(s) is a transfer function which has all its poles and zeros in the 
open left half-plane. Such a transfer function is called minimum-phase (for 
reasons which will become clear in Section 1.7.2). Also suppose that Q(0) > 0 
(and, of course, that Q(0) is real), and let 


L(¢) =log| Q(39)| (1.33) 


(£) = arg Q(j¢) (1.34) 
A=log({/a) (1.35) 
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It was shown by Bode (1945) that 
1 | ° dL(¢) 


ġo) =z 2 da 


logcoth|A/2| dA (1.36) 


Noting that the function logcoth|A/2| behaves approximately like the im- 
pulse 42?6(A), we obtain 


ndl 


(oy 5 


(very approximately) (1.37) 


From this, the usual ‘Bode plot’ rule of thumb follows: if the gain falls 
at a sustained rate of 20dB/decade of frequency (which corresponds to 
dL/dj = — 1), then the phase at those frequencies will be — 2/2 rad. If the gain 
falls at 40 dB/decade, the phase will be — rad, and so on. Our assumption 
that Q(0) >0 may be incorrect, in which case equation (1.36) gives the phase 
change which occurs between frequencies 0 and œ. If the transfer function has 
zeros in the right half-plane, then the phase will be more negative than is 
indicated by equation (1.36). 

We have seen that the solution to the archetypal feedback design 
problem is of the form ‘use high loop gain over some frequency range, then 
decrease the loop gain as rapidly as possible’. If only gain mattered, we could 
get arbitrarily fast cut-off of loop gain just by cascading large numbers of low- 
pass filters. But we need to maintain the stability of the loop. To do this we 
cannot have excessive phase lag at the frequency at which |K(jw)G(j)|=1 
because of Nyquist’s stability theorem. But we know from the gain—phase 
relationships that this restricts the magnitude roll-off rate: there is a phase lag 
of at least 90° for each 20 dB/decade of roll-off rate. 

For an open-loop stable, minimum-phase plant, we can say that the 
roll-off rate in the region of the gain cross-over frequency (that is, where the 
loop gain is unity) must not exceed about 40dB/decade. For a reasonable - 
stability margin it must be smaller than this. 

Note that this is a ‘local’ and rather approximate result, which can 
usually be used only to tell whether a specification is easy or difficult to meet. 
It is not usually possible to say that a specification is impossible to meet 


without doing a detailed design study. Consider the following (extreme) 
examples: 


EXAMPLE 1.3 
loop gain > 33, a<l 
loop gain < 1/33, w >10 


The average slope required is — 60 dB/decade, which implies a phase lag of 
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— 80 dB/decade 


Loop gain (dB) 


— 20 dB/decade 


Phase (deg) 


l | 
SEPA 
Not enough ‘space’ for 
phase angle to advance 


Figure 1.16 Bode plot with unattainable gain specification. 


270° (average) over one decade of frequency. We could try shaping the 
loop gain as shown in Figure 1.16, but we find that the region over which 
the gain falls at only 20dB/decade is too small to allow the phase to 
increase significantly. 


EXAMPLE 1.4 
loop gain > 33 x 10° for œ < 0.001 
loop gain < 1/33x 1076 for œ > 100 


The average slope required is — 60 dB/decade, as before, but now this is 
spread over five decades, and the previous strategy has a chance of success 
(because the —20dB/decade section can be made to last for, say, three 
decades). But note that such a design could only be conditionally stable: a 
reduction of the loop gain could make the loop unstable. 
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1.7.2 Right half-plane zeros 


Suppose that G(s) is stable, and has one real, right half-plane zero, at s=a. 
We shall show that the achievable performance of a feedback loop is 
then limited, in the following sense. Suppose that |K(jw)G(j@)| >y>1 
(0 < œ < wo). Then the open-loop gain—bandwidth product ya, is, in practice, 
limited. Furthermore, an approximate practical upper bound on œ, is w, < a. 

Note that these limitations occur even without constraints on u(t), 
and without the presence of measurement noise; also, œo is an open-loop 
bandwidth, but the upper bound on @y in effect puts a similar bound on @,, 
the loop bandwidth. 

We begin by writing 


G(s) = G(s) A(s) (1.38) 


where A(s)=(s—a)/(s+a) is an all-pass function (that is, |(j@—a)/ 
(jæ + a)| = 1, while its phase decreases). Its Nyquist diagram is shown in 
Figure 1.17, while Figure 1.18 shows how the phase of A(s) varies with 
frequency. Suppose that the gain cross-over frequency is œ, = a, and let us 
make the unrealistic assumption that we could tolerate zero phase margin. 
Then since 


arg A(ja)— arg A(0) = —4n 


we could allow arg K(jw)Go(jm) to change by —4z at most between œ = 0 
and œ =a. From the gain-phase relations we know this to mean that 
|K(j@)Go(j@)| must be falling at a rate of 20 dB/decade at most in the region 
of w = a. 

Figure 1.19 shows some possible (y,@) specifications. It will be seen 
that for a/10 < wy < a, roughly, the gain—bandwidth product is limited to 


yoo Ka (1.39) 


Of course, allowing for a realistic phase margin would reduce this further. 


Figure 1.17 Nyquist diagram of the all-pass transfer function (s— a)/(s+ a). 
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Phase (deg) 


Figure 1.18 Phase-frequency characteristic of (s— @)/(s+ a). 


If the cross-over frequency is reduced to w, = a/2, then A(s) contri- 
butes only 53° of phase lag there, so the slope of |KG | can be increased to 
about 28 dB/decade (still with zero phase margin) and hence we can achieve 
YOo <S 1.25a. 

In fact, yw, can be increased indefinitely by pushing œw, to a sufficiently 
low value. In other words, right half-plane zeros do not cause difficulties if 
they lie well outside the desired (closed-loop) bandwidth. 

Now suppose that we raise the cross-over frequency to @, = 2a. 
A(j@,) contributes 127° of phase lag, which leaves 53° for arg(KGo). 
In practice most of this would be needed for the phase margin, leaving 
a very shallow slope for |KG,|. However, for comparability with previous 
figures we assume again that the phase margin is zero. Then |KGo| can fall 
at 12 dB/decade, allowing 


YWo < 0.4a 
IK (j)Go (jev)| 
N, 
ba 

10 Yı 
l — 20 dB/decade 
| 

, | N g 


Figure 1.19 Some possible (open-loop) gain and bandwidth specifications when a 
zero is located at a in the right half-plane. 
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Table 1.1 Effects of right half-plane zero located at a>0. 


: ; : d 
wo, arg A(0)—arg A(jo,) —arg[K(jo,)Go(jo)] ae (9®0)max 
la 87° —19dB/decade 0.93a 
a 90° 50° —11dB/decade 0.36a 
2a 127° 13° —2.9dB/decade 0.14a 
4a 152° impossible — — 


With a phase margin of 40° the figures are as shown in Table 1.1. It is seen 
that œ, is limited to approximately œ. < a. 

When there is more than one zero in the right half-plane, the 0 dB 
cross-over frequency is similarly limited. (For a full discussion see Horowitz 
(1963).) 

As we said earlier, the loop bandwidth is usually very close to the 0 dB 
cross-over frequency. But this need not be the case if the plant has right half- 
plane zeros, so these zeros do not limit the loop bandwidth directly. For 
example, if we have a plant with transfer function 


l s-1 
= 1. 
G(s) =- +2 (1.40) 
and compensator 
—20(s + 2) 
K(s) = ——-——— i; 
(= scaly nag 


then we obtain a cross-over frequency of approximately 0.7 rads~'. How- 
ever, the closed-loop transfer function T(s) is 


_ 20(1—=s) 
ENET oa 
so that 
PEAS IER.. 
|T'(jo)| = ls aa (1.43) 


and we have a loop bandwidth of 20 rad s~ +. The sensitivity function S(s) for 
this design, however, is 


_  s(s+4i) 
S(s) = + 6420) (1.44) 
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so that |S(jw)| > 1 for œ > 0.35rads~', approximately. Between the fre- 
quencies 0.35 and 20 rad s~ ', then, this design amplifies disturbances without 
attenuating measurement noise, and is therefore extremely inefficient. If a 
two-degree-of-freedom structure is permissible, then a much better design is 
obtained by limiting the loop bandwidth to about 1 rads” +, since no benefit 
is obtained by going beyond this, and using a pre-filter to extend the reference 
signal transmission bandwidth to 20rads~', if that is really required. 

We see, then, that right half-plane zeros in fact limit the loop band- 
width for engineering rather than mathematical reasons. 


1.7.3 Right half-plane poles 


Rather surprisingly, the presence of unstable poles imposes a lower bound on 
the cross-over frequency. Suppose that a system has one real unstable pole 
located at s = a, and factorize the transfer function as 


G(s) = Go(s)A ~} (s) (1.45) 


where A(s) = (s—a)/(s + a), as before. Since Go(s) has all its poles in the left 
half-plane, and has at least as many poles as zeros, it is impossible for its 
Nyquist locus to make any anticlockwise encirclements of the point — 1 + jO 
(or of any other point). But we need one such encirclement if the closed loop is 
to be stable (by Nyquist’s theorem), and clearly we can only get this from the 
phase characteristic of A~*(ja). 

If we further suppose that |G(j@)|, and hence |Go(j@)|, is monotonic 
decreasing with respect to œ, then arg Go(ja) — arg Go(0) is always negative 
(by equation (1.36)). Consider Figure 1.20, which shows a possible Nyquist 
locus of G(s), with a phase margin @¢. If œ, denotes the 0dB cross-over 
frequency, then 


arg G(jw,) — arg G(0) = @ (1.46) 
But arg G(jw) = arg Go(jæ) + arg A7 + (jæ), so 
arg G(jo,) — arg G(0) = arg Go(jw.) — arg Go(0) 


+argA ‘(jw,)—arg A~ *(0) 


<argA'(j@.)—2 
Hence g Go.) 


argA‘(jo,)>o+n (1.47) 
Thus from Figure 1.17 (which shows the Nyquist diagram of A(s)), we see 


that ¢ = 7/4 implies that œ, > a/2 (approximately), @ = 2/2 implies that 
w, > a, and so on. 
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Figure 1.20 Possible Nyquist locus of open-loop unstable system. 


Thus we conclude that the bandwidth of the closed-loop system must 
be approximately a, at least, if the |G(ja)| characteristic has the usual shape, 
namely a monotonic decreasing function of frequency. As is to be expected, 
this lower bound for the bandwidth is increased by the presence of other right 
half-plane poles. 


1.7.4 Bode’s integral theorem 


Figure 1.21 shows the Nyquist locus of some system. We see that, for 
w, <@<@p, |S(j@)| > 1, in other words there is a frequency region in 
which feedback would increase the sensitivity of the system rather than 
decrease it. It is easy to see that for any system which is open-loop stable, and 


Figure 1.21 Nyquist locus of a system, with part of |S|=1 circle. 
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Figure 1.22 Nyquist locus of system which has two unstable open-loop poles, but 
is closed-loop stable. 


which has a high-frequency roll-off steeper than —20dB/decade, such a 
region must always exist since the gain—phase relations tell us that the 
Nyquist plot of such a system must be heading towards the origin at a phase 
angle more negative than —4 n; and Nyquist’s stability theorem says that, if 
this phase angle is more negative than —z, then the locus must cross the 
negative real axis between — 1 and 0. Thus the locus is bound to enter the unit 
circle which is centred on —1. 

If the system is open-loop unstable, the same is true. Figure 1.22 shows 
a possible locus for a system with two open-loop unstable poles (and roll-off 
steeper than —20dB/decade). Again, the gain—phase relations are such that 
the locus is bound to penetrate the circle. 

The reader may think that it may be possible to devise a system whose 
Nyquist locus is shaped in such a way as to avoid penetrating the circle, but 
this was shown by Bode (1945) to be untrue. The following theorem was first 
stated by Bode for open-loop stable systems, and extended to the form given 
here by Freudenberg and Looze (1985) (see also Freudenberg and Looze, 
1987): 


Theorem 1.2: Suppose that G(s)K(s) has the set of open right half- 
plane poles {p;: i= 1,..., N} (including multiplicities), and that 


lim sup R|G(s)K(s)|=0 (‘roll-off condition) (1.48) 
R->o |s|>R 
Re(s)> 0 


and let S(s) = [1+ G(s)K(s)]7t, as before. If the closed-loop is 
stable, then 


[[ misgoldo =z 5 Re(p;) (1.49) 
0 iTi 
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This theorem has the following consequences: 
(1) For open-loop stable systems, 


[/ tnistioyide =0 (1.50) 


0 


That is, in a sense there is as much sensitivity increase as there is 
decrease. If the sensitivity (in decibels) is plotted against frequency 
(linear scale), then the positive area under the graph will equal the 
negative area. 


(2) For open-loop unstable systems the position is worse. There is now 
more positive than negative area. Fast (open-loop) unstable poles are 
more harmful than slow ones. Intuitively, one can perhaps think of 
some of the feedback as being ‘used up’ in shifting poles into the stable 
region, and only part of it being ‘available’ for the reduction of 
sensitivity. This gives a theoretical justification for preferring (open- 
loop) stable controllers — but there are also powerful practical reasons 
for this preference. 


(3) If one pushes the sensitivity lower at some low frequencies, the penalty 
is a higher peak value of the sensitivity at some higher frequency. This 
is not apparent from the theorem, since one might think that the 
corresponding increase required in the positive area could be obtained 
by making an arbitrarily small increase in S(jm), spread over an 
arbitrarily large frequency range. Freudenberg and Looze (1985) show 
that this is not so. In fact, if 


M 
then 
| ndodos =i- (1.52) 
z k 1i—e 


c 


which tends to zero as € tends to zero, so that most of the required 
positive area must be acquired at frequencies below @,, if € is very 
small. 


$e U 
SUMMARY 


In this chapter we have reviewed the analysis and design of single-loop 
feedback systems. We have considered the possibilities of both ‘closed-loop’ 
design, in which a desired closed-loop transfer function is somehow estab- 
lished, and a controller is then synthesized automatically, and ‘open-loop’ 
design, in which the plant characteristics are taken account of explicitly. 
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We have shown that there are fundamental trade-offs between the 
conflicting objectives of reducing sensitivity to disturbances and parameter 
uncertainty on the one hand, and filtering out any internally generated 
noise on the other. It is the existence of these trade-offs which makes 
feedback design difficult. The standard approach to reconciling the con- 
flicting objectives relies on making the loop gain high at low frequencies, 
then reducing it as quickly as possible at high frequencies. However, the 
rate of gain reduction is limited by the need to maintain stability of the 
feedback loop, since a large rate of gain reduction is possible only with a 
large associated phase lag, and the allowable phase lag is limited by 
Nyquist’s stability theorem. 

The location of any zeros or poles in the right half-plane restricts the 
range of frequencies over which the use of feedback can be beneficial: right 
half-plane zeros impose an upper bound on the range of frequencies over 
which the sensitivity can be reduced, while right half-plane poles impose a 
lower bound on the range of frequencies over which noise must be passed 
without attenuation (the loop bandwidth). 

We shall see that all of these considerations continue to hold for 
multivariable feedback systems. 


EXERCISES 


tS 


1.1 The peak phase change given by a simple first-order phase-lead or 
phase-lag compensator is given by equation (1.7). Plot a graph of this 
peak phase change for Í < « < œ. 
In practice, various considerations usually limit the range of a 
to 0.1 <a < 10. What is the practical peak phase change obtainable 
with such a compensator? 


(Solution: 55°) 


1.2 A plant has transfer function 1/ s?. Design phase-lead compensators 
that will give a 40° phase margin, and a closed-loop bandwidth of 
(a) 5 Hz, (b) 0.02 Hz. Compare the closed-loop transient responses of 
the two designs. 


1.3 Let 


1.4 


1.5 


1.6 
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and let w, denote the cross-over frequency, at which |G(ja,)| = 1. If ¢ 
is the phase margin, show that 


1 
Tijo) = ——— 
Hence show that a specification framed in terms of M-circles impli- 
citly specifies minimum gain and phase margins. What gain and phase 
margins are guaranteed if the Nyquist locus remains outside the 
M =.,/2 circle? 


(Solution: 4.6 dB, 41°) 


Suppose that 


s—3 5 
G(s) = s41 and T(s) = 3+5 


Find the corresponding compensator K (s), if 


_ G(s)K(s) 
i ake G(s)K(s) 


Obtain the corresponding Nyquist locus. Does it predict stability? 
Show that the closed loop is in fact unstable, because of a hidden 
unstable mode. (Hint: this is most easily done by using state-space 
models.) 

Suggest a T(s) which leads to a stable feedback loop, but still 
has a bandwidth of 5rads~*. Examine the corresponding sensitivity 
function, and discuss the advisability of using a two-degrees-of- 
freedom design. 


A plant has a zero at s=3 and a transport delay of 1.0s. 
Estimate the highest achievable 0 dB cross-over frequency, assuming 


that the open-loop gain-frequency characteristic is monotonic 
decreasing. 


(Solution: 2 rad s~1) 


Give an example of a plant for which it is impossible to find a feed- 
back design which is ‘good’ in the sense of giving stability, 
sensitivity reduction at low frequencies and noise attenuation at high 
frequencies. 


re rp 
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2.1 Introduction 


We shall take the basic description of a linear multivariable system to be a 
transfer-function matrix. This is simply a matrix G(s) of transfer functions, in 
which the (i, j) element g,,(s) is the transfer function relating the ith output to 
the jth input. If each element gij(s) is rational, and proper, namely if 


|gij(s)| > |d,;| < 00, as |s| > œ (2.1) 

then one can find a state-space model 
x = Áx + Bu, y = Cx + Du (2.2) 
37 
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(a) 


Figure 2.1 Connections of two systems: (a) in series, (b) in parallel, (c) negative 
feedback. 


in which u is the vector of system inputs, y is the vector of outputs and x is a 
state vector. A, B, C and D are real matrices of appropriate dimensions, and 
are related to the transfer function G(s) by 


G(s) = C(sl —A)"1B+D (2.3) 


The 4-tuple (A,B,C, D} is said to be a realization of G(s), and we shall 
sometimes write G(A, B, C,D) to denote this. Note that a realization is not 
unique, because (A, B, C, D) is also a realization if T is any invertible matrix 


and 
ĀÃ=T AT, B=T"'B, C=CT (2.4) 


Transmission paths through the system often contain time delays; the 
elements of G(s) are then transcendental functions, and no state-space 
realization exists. This is inconvenient, because many of the procedures for 
analysing and designing multivariable systems are best implemented by 
algorithms which operate on state-space models, but it is not disastrous 
-many of the procedures can be implemented by operating directly on 
frequency-response data, namely on matrices such as G(ja). Alternatively, 
transfer functions of time delays (ec ““, where d is the delay duration) can be 
approximated by rational functions. 
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Consider two systems, defined by transfer-function matrices G,(s) 
(i = 1,2), their input signals being the vectors u; (i = 1,2) and output signals 
the vectors y; (i = 1,2), so that 


yi(s) = G,(s)u,(s), i= 1,2 (2.5) 


If the dimensions of y, and u, are the same, then we can form the series 
connection as shown in Figure 2.1(a), and obtain 


Y2(s) = Gz (s)G,(s)uz(s) (2.6) 


(Note that the order of multiplication is important because matrix multipli- 
cation is not commutative.) 

If both systems have the same number of input signals and the same 
number of output signals (and hence both transfer-function matrices have the 
same dimensions) then we can form the parallel. connection as shown in 
Figure 2.1(b), and obtain the relation 


y(s) = [Gi (s) + G(s) ]u(s) (2.7) 
If the dimensions of y, and u, are the same and the dimensions of Uy 


and y, are the same, then we can form the feedback connection shown in 
Figure 2.1(c). In this case we have 


yi (s) = G,(s)[u(s) — G,(s)y,(s)] (2.8) 
or 

[I + G,(s)G2(s)]y,(s) = G,(s)u(s) (2.9) 
so that 

yi(s) = [I + G,(s)G2(s)]~"G, (s)u(s) (2.10) 


Alternatively, by starting with 

u, (s) = u(s) — G2 (s)G,(s)u, (s) (2.11) 
we can derive 

yi (s) = Gi (s)[I + Gz (s)G,(s)]~ tu(s) (2.12) 


Note that (2.10) and (2.12) are alternative expressions for the same closed- 
loop transfer function. If we have a negative-feedback connection as 
in Figure 2.1(c), then we call the products ~G,(s)G,(s) and —G,(s)G,(s) 
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return ratios, and the matrices J + G,(s)G,(s) and J + G,(s)G,(s) are called 
return differences (MacFarlane, 1970). Note that their dimensions are not all 
the same, unless both G,(s) and G,(s) are square. (If a positive-feedback 
convention is used, then the return ratios are G,G, and G, G3, and the return 
differences are I — G, G, and I — G,G,.) 


2.2 The Smith-McMillan form of a transfer-function 
matrix 


In order to generalize the ideas and results of Chapter 1 to multivariable 
feedback, we shall need to find appropriate generalizations of poles and zeros. 
We shall do this by showing that every rational transfer-function matrix can 
be reduced to a diagonal canonical form known as the Smith-McMillan 
form, and then defining poles and zeros in terms of this Smith—-McMillan 
form. 

Every rational transfer-function matrix can be expressed as a poly- 
nomial matrix, divided by a common denominator polynomial. We therefore 
begin by showing that every polynomial matrix can be reduced to a canonical 
form known as the Smith form. 


Definition 2.1: A polynomial matrix U (s) is called unimodular if it 
has an inverse which is also a polynomial matrix. 


Theorem 2.1: U(s)is a unimodular matrix if and only if det U (s) is a 
constant (independent of s). 


Proof: If U(s)is unimodular, then it has a polynomial matrix inverse 
U(s)~1 such that U(s)U(s)~* = I. This implies that 
det [U(s)]det[U(s)"*] = 1 


Since both det [U(s)] and det [U(s)~'] are polynomial expressions, 
they must be constants. Conversely, suppose that det [U(s)] =cisa 
constant. Then 


1 
U(s)"* = [adj U(s)] 


where adj U(s) denotes the adjoint of U(s). Since the adjoint matrix is 
a polynomial matrix, then so is U(s)7t. E 


There are three elementary operations which can be performed on 
polynomial matrices: 


e interchange of two rows or columns 
e multiplication of one row or column by a constant 
e addition of a polynomial multiple of one row or column to another 
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Each of these elementary operations can be represented by multiplying a 
polynomial matrix by a suitable matrix, called an elementary matrix. It is 
easy to show that all elementary matrices are unimodular. 

We say that two (polynomial or rational) matrices P(s) and Q(s) are 
equivalent (symbolized P(s) ~ Q(s)) if there exist sequences of left and right 
elementary matrices {L,(s),...,L,(s)} and {R,(s),.. .»R,(s)} such that 


P(s) = L,(s). . .L,(s)Q(s)R,(s). . .R,(s) (2.13) 


The next result states that every polynomial matrix is equivalent to a 
diagonal polynomial matrix known as the Smith form. 


Theorem 2.2: Let P(s) be a polynomial matrix of normal rank r (i.e. 
of rank r for almost all s). Then P(s) may be transformed by a 
sequence of elementary row and column operations into a pseudo- 
diagonal polynomial matrix S(s) having the form 


S(s) = diag {¢,(s), @.(s),...,¢,(s),0,0,.. .,0} (2.14) 


in which each ¢,(s) (i=1,...,r) is a monic polynomial (i.e. has 
leading coefficient 1) satisfying the divisibility property 
&(S)|& 4 1(s), i=1,...,r—1 


(that is, ¢;(s) divides ¢;,,(s) without remainder). Moreover, if we 
define the determinantal divisors 
Do(s) = 1 


D;(s) = greatest common divisor of alli xi > (2.15) 
minors of P(s) 


where each greatest common divisor is normalized to be a monic 
polynomial, then 


D;(s) 
D; -i (s7 


&,(s) = RN 2 (2.16) 


The matrix S(s) is the Smith form of P(s), and the ¢;(s) are called the 
invariant factors of P(s). 


Proof: The reduction to Smith form can be carried out in the 
following steps: 


(1) From all the non-zero elements of P(s), choose one which has 
least degree and bring it to the (1, 1) position by permuting the 
rows and columns. 
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(2) 


(4) 
(5) 


Write the element p,,(s) (if non-zero) as p21(s) = 9(s)P11(S) 
+r(s), where r(s) is either zero or such that 
deg r(s) < deg pı; (s). Subtracting q(s) times the first row from 
the second row, the (2, 1) element becomes r(s). If r(s) is zero, 
proceed to the next step; otherwise interchange rows 1 and 2 
and repeat the present step. Since the degree of the (1,1) 
element must decrease by at least one every time we repeat this 
step, we must end up with a zero in the (2, 1) position. 

In a similar manner to that described in step 2, use elementary 
row operations to make zero every element of the first column 
except the (1, 1) entry. 

By means of column operations, reduce every element in the 
first row to zero except the (1, 1) entry. 

Step 4 may, however, have reintroduced non-zero entries be- 
low the (1, 1) position in the first column. If this has happened, 
return to the beginning of step 2. However, on each cycle 
through the steps 2, 3, 4, 5 and 2, the degree of the (1, 1) element 
drops, so the process must terminate with a matrix of the form 


0 ! e(s) f(s) ... g(s) (2.17) 


h(s) k(s) book I(s) 


If any element of columns 2, 3,. . . is not divisible by a(s), we 
add this column to the first column and then go back to the 
beginning of step 2. Again, on each passage through the cycle 2, 
3, 4, 5, 6 and 2, there must be a reduction in the degree of the 
(1,1) element. So the process terminates with a matrix of the 
form (2.17) in which a(s) divides every other non-zero element. 
We may assume that a(s) is monic, and put é,(s) = a(s). 


Now delete the first row and first column of the matrix and 


repeat the above procedure on the remaining submatrix. We then get 
a second matrix equivalent to P(s): 


e, (s) 0 | O ... 0 

0 £(s) 0 0 
eee O i w(s) ..- x(s) 
00 Fs)... 268) 
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in which ¢,(s)|e,(s)| {w(s),x(s),. . .» ¥(s),2(s)}. It is clear that the 
procedure may be repeated until we get the Smith form of P(s). 

The fact that there are r non-zero invariant factors follows 
from the observation that elementary operations do not change the 
rank of a matrix. The second part of the theorem follows from the 
Binet-Cauchy theorem (Gantmacher, 1959). The D;(s) are invariant 
under elementary operations, and hence the determinantal divisors of 
P(s) and S(s) are the same. Hence we have that 


D,(s) = e, (s) 


D,(s) = e, (s)ez(s) 


D,(s) = €,(s)é(s). . .&,(S) 


from which equation (2.16) follows. E 
EXAMPLE 2.1 
Let 

1 —1 
P(s)=| s*+s—4 2s?—s-8 (2.18) 
s? —4 2s? — 8 

then 

Do(s)=1 

D, (s) = ged {1, —1,s? + s—4, 2s? —s—8,s*—4, 25? 8}=1 
(where ‘gcd’ means ‘greatest common divisor’) 

1 —1 1 -1 
D = gcd 
a(s) = go | sS?+s—4 252-58 | s?~—4 257-8 | 
s?+s—4 252-5—8 
s*—4 2s? —8 
= (s+ 2)(s — 2) 

so that 


é,(s)=1 and €2(s) = (s+ 2)(s—2) 
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It follows that 


1 0 
P(s) ~ S(s)=| 0 (s+2)(s—2) (2.19) 
0 0 


It is clear that the Smith form of a polynomial matrix is uniquely 
defined by equation (2.16), and that two equivalent polynomial matrices have 
the same Smith form. The Smith form is thus a canonical form for a set of 
equivalent polynomial matrices. 

Now we extend the idea of a canonical form to rational matrices. 


Theorem 2.3 (Smith-McMillan form): Let G(s) be a rational matrix 
of normal rank r. Then G(s) may be transformed by a series of 
elementary row and column operations into a pseudo-diagonal 
rational matrix M (s) of the form 


BST (s) £2(5) &,(s) 
Ms) = diag] CL, EO, EST OS hea § o} (2.20) 


in which the monic polynomials {¢,(s),,(s)} are coprime for 
each i (i.e. they have no common factors) and satisfy the divisibility 


properties 
€;(s)|&+1(S) ; 
=1,...,r-—1 2.21 
on eee ee 


M(s) is the Smith-McMillan form of G(s). 
Proof: Let d(s) be the least common multiple of all the denominators 
of the elements g,,(s) of G(s). Then G(s) can be written as 


1 
G(s) = uP) (2.22) 


where P(s) is a polynomial matrix. By Theorem 2.2, P(s) is equivalent 
to its Smith form: 


P(s) ~ S(s) = diag {¢4(s),. . eves), 0-350} (2.23) 


It follows that 


G(s) ~ rene = M(s) (2.24) 
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where each diagonal element &;(s)/W;(s) of M(s) is obtained by 
performing all possible cancellations between the numerator and 
denominator of ¢;(s)/d(s). 


The divisibility property (2.21) is a direct consequence of the fact that 
ei(s)le; +1 (8), i=1,...,r—1 E 


EXAMPLE 2.2 


Let 1 — 1 
s? +3s+2 s? +3s+2 


s? +s—4 2527-58 


ee) S? +3s+2 52743542 
s—2 2s —4 
stl s+1 
Then 
l 1 
— 2. 
ao) s? +3542 EO) 225) 


where P(s) is the polynomial matrix of Example 2.1, with Smith form 
given by (2.19). Proceeding as in the proof of Theorem 2.3, we find that the 
Smith-McMillan form of G(s) is given by 


1 
i (s+1)(s+2) j 
0 0 


2.3 Poles and zeros ofa transfer-function matrix 


In single-input/single-output (SISO) systems, the poles and zeros of the scalar 
transfer function 


_ nės) 
(s a (2.27) 


(with n(s) and d(s) coprime polynomials) are given by the roots of d(s) and 
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n(s), respectively. We now define the poles and zeros of a transfer-function 
matrix by means of the Smith-McMillan form.' 


Definition 2.2: Let G(s) be a rational transfer-function matrix with 
Smith-McMillan form M(s), as in (2.20), and define the pole poly- 
nomial and zero polynomial l 


p(s) = Y1 (s). - Cs) (2.28) 
z(s) = £, (8). . -& (8) (2.29) 


The roots of p(s) and z(s) are called the poles and zeros of G(s), 
respectively. 


In other words, the poles of G(s) are all the roots of the denominator 
polynomials y;(s) of the Smith-McMillan form of G(s). If po is a pole of G(s), 
then (s — po)” (v > 1) must be a factor of some w;(s). The number v is called 
the multiplicity of the pole, and if v = 1 we say that po is a simple pole. Zeros 
and their multiplicity are defined similarly, in terms of the numerator 
polynomials ¢,(s) of the Smith-McMillan form. 


EXAMPLE 2.3 


The pole and zero polynomials of the transfer-function matrix G(s) of 
Example 2.2 are 


p(s) = (s+ 1)?(s +2) 
z(s) =(s—2) 


Hence G(s) has poles at {—1,—1, —2} and a zero at 2. Since 
W1(s)=(st1)(s+ 2) and w,(s) = s +1, all the poles are simple. 


Corollary 2.1: If G(s) is square, then . 


det G(s) = 5 
p(s) 
for some constant c. Note that although the pair of polynomials 
{e(s),Wi(s)} is coprime for each i, it is possible that there exist 
common factors between z(s) and p(s) which cancel out in forming 
det G(s). 
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Definition 2.3: The degree of the pole polynomial p(s) is the 
McMillan degree of G(s). 


Zeros defined via the Smith-McMillan form are often called trans- 
mission zeros, in order to distinguish them from other kinds of zero which 
have been defined. It is clear from equation (2.20) that the rank of G(s) drops 
below its normal value whenever s = Zp, if zp is a zero of G(s). Hence there 
exists a non-zero vector uo such that G(Zo)}uo = 0. If the input-signal vector 
has transform 


u(s)= —2 (2.30) 
AY = Zo 
then the output is given by 

y(s) = G(s)u(s) + initial-condition response (2.31) 

R.ulp; 
z Glao a Sv RUP agonian rone. G 

S—Zo «FSD; 

= 0 


if the initial conditions are chosen so as to cancel out the second term in 
(2.32), in which p; denotes a pole of G(s), R; denotes the residue of G(s) at 
s = p; and all the poles are assumed to be simple. Hence transmission zeros 
have a transmission-blocking property. (For further details of this interpret- 
ation of zeros, and an analogous interpretation of poles, see Desoer and 
Shulman (1974).) 

Note that the locations of poles can be determined simply by examin- 
ing the elements of the transfer-function matrix—poles occur only at those 
points at which the elements have poles. Their multiplicities, and even their 
numbers, cannot be determined so easily, however. It is sometimes important 
to establish their numbers — for example, if there exist right half-plane poles, it 
is necessary to know how many of them there are before applying Nyquist- 
like stability theorems. The situation is worse with zeros: one cannot establish 
even the location of zeros by inspection of the transfer-function matrix 
elements. 

Although the proofs of Theorems 2.2 and 2.3 provide an algorithm for 
finding poles and zeros, it is not a practical one for use with real systems. 
Reliable and practical algorithms require a state-space realization as a 
starting point; such algorithms are described in Chapter 8. 

The significance of poles may be summarized very simply. Each pole p; 
of a transfer-function matrix G(s) must also appear as a pole of at least one of 
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its elements. It is therefore possible to write G(s) in ‘partial fractions’ as 


y 


G; 
Sog 2, (s— DF a 


(assuming that G(s) is proper), where G; and Go are constant matrices, and k; 
is some positive integer. Hence the impulse-response matrix of the system, 
which is obtained as the inverse Laplace transform of G(s), is 


G(t) = 2 G:t" terit + Goô(t) 


The relation of pole locations to system stability is therefore the same as for 
SISO systems: a system is asymptotically stable if Re { p;} < 0 for each i, and 
it is stable if Re { p;} < 0 and k; = 1 whenever Re { p,} = 0. 

We have considered only rational transfer functions; in practice, the 
relation of pole locations to system stability is the same even when elements 
of the transfer-function matrix are irrational (Callier and Desoer, 1982; 
Rosenbrock, 1974). 


2.4 Matrix-fraction description (MFD) of a 
transfer function 


Throughout this section, G(s) will denote an m x l strictly proper, rational 
transfer-function matrix, namely one such that G(oo) = 0. Let L(s)~' and 
R(s)~*' be the unimodular matrices that take G(s) to its Smith-McMillan 
form M(s): 


G(s) = L(s)M(s)R(s) = Lis)diag | ae ay T obro 
(2.33) 
We may write M(s) as 
M(s) = N'(s)D'(s)~* (2.34) 
where N’(s) and D’(s) are polynomial matrices defined as 
Ba at a A 039 


and D'(s) is a square matrix of dimension l x I. 
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Substituting (2.34) into (2.33) yields 


G(s) = L(s)N‘(s)D’(s)"* R(s) 
= [L(s)N’(s)][R(s)"*D’(s)]~* 


= N(s)D(s)! (2.36) 
where 


N(s) = L(s)N‘(s) and D(s) = R(s)~'D'(s) (2.37) 


Note that since R(s) is unimodular, R(s)~' is a polynomial matrix and so 

both N(s) and D(s) are polynomial matrices. We have therefore expressed 

G(s) as a ‘fraction’ of two polynomial matrices. The representation (2.36) for 

G(s) is called a right matrix-fraction description, and N (s) and D(s) are called 

the numerator matrix and the denominator matrix, respectively, of the MFD. 
Note that in the MFD (2.36), 


(1) z is a zero of G(s) if and only if N(z) loses rank 
(2) p is a pole of G(s) if and only if D(p) loses rank 


Another way of stating 2 is that the pole polynomial is given by the 
determinant of the denominator matrix. 


Evidently, an MFD representation is not unique. For example, if X (s) 
is invertible, then 


G(s) = [N(s)X(s)] [D(s)X(s)]? (2.38) 

= N(s)D(s)~1 (2.39) 

so that (N(s), D(s)) is also an MFD of G(s). We are frequently interested in 
removing any unnecessary common factors, such as X(s), and we therefore 


make the following definition: 


Definition 2.4: Let N(s) and D(s) be polynomial matrices with the 
same number of columns. If there exist N(s) and D(s) such that 


N(s)=N(s)U(s) and D(s)=D(s)U(s) 
only for unimodular U(s), then N(s) and D(s) are said to be right 
coprime. 
” If N (s) and D(s) have the same number of rows, and there exist 
N(s) and D(s) such that 


N(s)=U(s)N(s) and D(s)=U(s)D(s) 
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only for unimodular U(s), then N(s) and D(s) are said to be left 
coprime. 

An MFD G(s)=N(s)D~*(s) (or D~'(s)N(s)) is said to be 
irreducible if N(s) and D(s) are coprime (left or right, as appropriate); 
otherwise it is reducible. 


With the aid of this definition we can state the following theorem: 


Theorem 2.4: If G(s)=N(s)D~ *(s) (or D~ +(s)N(s)), and N (s) and D(s) 
are coprime, then 


(1) z is a (transmission) zero of G(s) if and only if N(s) loses rank at 
S=2Z 


(2) p is a pole of G(s) if and only if D(p) is singular 
Proof: See Kailath (1980), Chapter 6. 


Corollary 2.2: If G(s)=N(s)D~'(s) (or D~*(s)N(s)), and N(s) and 
D(s) are coprime, then the pole polynomial of G(s) is given by 


p(s)= det D(s) (2.40) 


2.5 State-space realization from a transfer-function 
matrix 


The formula (2.3) can be rewritten as 


C adj (sl — A)B 
OF aaa 
Since C adj (sI — A)B is a polynomial matrix, it is clear that every pole of G(s) 
must be a zero of det (sI — A), namely an eigenvalue of A. The converse is not 
true, however, because zeros of det (sI — A) may be cancelled in equation (2.3), 
and hence not appear as poles of G(s). Such cancellations occur precisely 
when the realization (A,B,C, D} is uncontrollable or unobservable. If the 
realization is both controllable and observable, then the poles of G(s) are the 
eigenvalues of A-—or, more precisely, the pole polynomial p(s) is given by 


p(s)= det (sI — A) 


This implies that the dimension of A cannot be smaller than the McMillan 
degree of G(s); for this reason a realization which is both controllable and 
observable is called minimal. 
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We summarize these results in the following theorem: 


Theorem 2.5: 
Let G(s) have a minimal realization (A, B, C, D), and let p(s) be 
the pole polynomial of G(s) (as defined via the Smith-McMillan 
form). Then 


(1) dim A = deg p(s) 


In other words, the McMillan degree of G(s) is the minimal 
dimension of a realization. 

(2) The eigenvalues of A are just the poles of G(s). (Note that if 
(A, B, C, D) is a realization which is not minimal, then the poles 
of G(s) are a subset of the eigenvalues of A.) 


There are several ways of finding a state-space realization which 
corresponds to a given transfer-function matrix. Perhaps the most straight- 
forward is the following, in which a separate realization is found for each 
column of the transfer function -that is, for each input—and these separate 
realizations are then assembled. Let g;(s) be the ith column of G(s), so that 


G(s)= [91(s), G(s), ees gi(s) J 
and let each column be written as 


n,(s) 
d;(s) 


gi(s)= +6; 


where d,(s) is the (monic) common-denominator polynomial of g,(s), 
d(s)=s"+d}st-14+ ... +4 


n,(s) is a vector of polynomials, each of degree smaller than k;, and ô; is a 
vector of constants. Let the jth element of n,(s) be the polynomial 


val) = vis T i vis T A 0. +0 
Now let 
1 0 0 0 
0 0 1 0 0 
A=]: 
0 0 0 0 aa 1 


—di ght gh — dki? sexe =d! 
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0 

ki „kii ... „1 
Vi; Vii Vii 

0 ki yki-1 1 
: Vai Vz: "Way 

B= S , C;= ‘ ; 
0 : : 

ki yki-1 1 
Vmi Vmi Vmi 


Then it is straightforward to show that (A;, B;, C;, ô;) is a realization of g;,(s). 
A realization of G(s) is then given by (A, B, C, D), where 


A=diag {A,,A,...,A)}, B=diag {B,,B,,...,B;}, 


C=[C,,C,,...,C,], D=[6,,63,...,6;] 


This realization is guaranteed to be controllable, but may not be observable. 
For many purposes this does not matter, but if it is desired to obtain a 
minimal realization then the algorithm given in Section 8.3.5 may be used to 
remove any unobservable modes. 

Other methods of finding state-space realizations are given by Kailath 
(1980) and Chen (1984). | 


2.6 How many zeros? 


It is easy to tell at a glance how many zeros a scalar transfer function has, but 
this is not true of transfer-function matrices. Transfer functions of systems 
with unequal numbers of inputs and outputs are not square. If one bears in 
mind that zeros are points in the complex plane at which a transfer function 
loses rank, then it is clear that, for a non-square transfer function to have a 
zero at some point, several of its minors must simultaneously become 
singular at that point. One can construct examples for which this happens, 
but only with carefully chosen coefficients. Generically, non-square transfer 
functions have no zeros. 


EXAMPLE 2.4 


s+2 s+4 


The normal rank of G(s) is 1, and there is no value of s for which both 
elements simultaneously become zero. 


so- a 
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s+1 s+3 
s+2 s+4 
1 s+i 
a) = s+5 s+6 
s+3 1 
s+4 s+2 


The normal rank of G(s) is 2, and there is no value of s for which the second 
column becomes a multiple of the first. 


In order to discuss the number of zeros of a square transfer-function 
matrix, it is useful to introduce the concept of poles and zeros ‘at infinity’. 
First, we examine the concept as applied to scalar transfer functions. Suppose 
that 


If n>m we say that G(s) has n—m zeros at infinity, and if m>n we say it has 
m—n poles at infinity. This allows us to assert that every scalar transfer 
function has as many zeros as poles (if those at infinity are included), and that 
improper (unrealizable) transfer functions are those with poles at infinity. 
Furthermore, the number of zeros at infinity is obviously related to the high- 
frequency gain behaviour of G (jœ) in a simple way, and we can now say that, 
on a root-locus plot, every pole is ‘attracted’ to a zero as the parameter of the 
locus becomes unbounded. 


Definition 2.5: Let H(A)=G(1/A). The poles and zeros of G(s) at 
s=oo are then the poles and zeros of H(A) at 4=0. 


Theorem 2.6: If G(s) is square, it has as many zeros as poles 
(including those at infinity). 


Proof: Let H(A)=G(1/A). Then 
det H(A) =det G(1/A) (2.41) 


__ (t/a) 
kaa) 


(2.42) 
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where z(s) and p(s) are the zero and pole polynomials, respectively, of 


G(s). 
Now let 
z6)=[[(-2) and p()=[] 6-p) 
Then 
2(1/A)=4-™ T] (1 Az) (2.43) 
i=i 
and 
p(t/a)=2-" J] (2p) (2.44) 
Hence 
det H(A) =ka"-™ J] (1—Az) | [] (—ap,) (2.45) 
i=1 i=i 


Now, let H(A) have z,, zeros at 0, and p,, poles at 0. Then, from (2.45), | 
Zo — Pa =n—M (2.46) 


But z,, and p, are the numbers of zeros and poles, respectively, of G(s) 
at infinity. 

Also, let z, and p, be the numbers of finite zeros, poles respect- 
ively, of G(s). Then 


Pp—-2Zp=n—M (2.47) 
Hence 
Pp 25 = 25 — Pw (2.48) 
or 
Pet Po = 2 HZ (2.49) 
a 


Note that this theorem does not hold for non-square transfer functions. 
Now suppose that G(s) is square, of dimensions m x m, and has a 
minimal state-space realization (A, B,C, D), of state dimension n. Then 


G(s)=D+C(sI— A)" +B (2.50) 
which can be expanded as? 


CB CAB CA?B A 


G@=D+—+ > (2.51) 
S S 


s2 
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for |s| large enough, so 
A(A)=D+ACB+A*CAB+/3CA7B+ ... (2.52) 


for |A| small enough. 

From this we see that H(0)=D. Suppose that rank (D)= u. Then the 
rank of H(A) drops from m to u at 1=0, and therefore H(A) has at least m— pu 
zeros at A=0. Thus G(s) has at least m— u zeros at infinity, and therefore at 
most n—m-+ p finite zeros. In the common case when p=m, the rank of G(s) 
does not fall at infinity, and we conclude that G(s) has n finite zeros, and no 
infinite zeros. 

If D =0, and therefore 4 =0, we see that G(s) has at most n—m finite 
zeros, and at least m zeros at infinity. The exact number of finite zeros can be 
established by examining the Markov parameters CA'~'B according to a 
rather complicated algorithm. For our purposes it will be sufficient to note 
that if rank(CB)=m, then there are exactly n—m finite zeros, but that if 
rank (CB) =m —d then there are at most n—m—d finite zeros. 

For a more complete treatment of the topic raised in this section see 
Kailath (1980), and MacFarlane and Karcanias (1976). 

An algorithm for finding zeros is given in Section 8.3.7. 


2.7 Internal stability 


Definition 2.6: A rational transfer-function matrix is exponentially 


stable if and only if it is proper and has no poles in the closed right 
half-plane. 


Clearly, this definition is appropriate for continuous-time systems; 
when considering discrete-time systems we need to replace ‘closed right half- 
plane’ by ‘boundary and exterior of the origin-centred unit circle’, as usual. 

The reason for insisting on properness is that improper systems may 
give unbounded outputs with bounded inputs, even if they have no (finite) 
poles, as shown by the following example: 


EXAMPLE 2.5 


G(s)=s, input: sin(t?), output: 2t cos (t?) 


Consider the feedback loop shown in Figure 2.2, and define transfer functions 
Hi, (s), H,2(s), H2,(s) and H,,(s) such that 


ei(s) |_| Ay,(s) Hy,2(s) || u4(s) 
PAE ai (2.53) 


Note that in Figure 2.2 a positive-feedback convention has been used. 
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Figure 2.2 Feedback configuration for investigation of internal stability. 


Definition 2.7 (see Note 3): The feedback system shown in Figure 2.2 
is internally stable if and only if the transfer-function matrix 


ree 2i 
H,,(s) H32(8) 


is exponentially stable. 


The purpose of this definition is to exclude right half-plane pole-zero 
cancellations taking place between two systems G(s) and K(s), which cannot 
be detected by Nyquist-like tests, since these are applied to the return ratio 
G(s)K(s) (or K(s)G(s)). The definition states, in effect, that each of the four 
transfer functions H,;(s) must be exponentially stable. 

If K(s) and G(s) are both unstable, then it is necessary to check all four 
of these transfer functions to ensure internal stability. However, if at least one 
of them is stable (as is usually the case) then there is less work to do: 


Theorem 2.7: If K(s) is exponentially stable, then the feedback 
system shown in Figure 2.2 is internally stable if and only if 
H,,(s)=[1 —G(s)K(s)]~'G(s) is exponentially stable. 


Proof: ‘Only if’: Immediate. 


‘Tf’: 
I+K(s)H,,(s)=1+K(s)LI—G(s)K(s)] 'G(s) (2.54) 
=1I+K(s)G(s)[I —K(s)G(s)]* (2.55) 
= [I — K(s)G(s)]"! (2.56) 
=H, ,(s) (2.57) 


So if K(s) and H,,(s) are exponentially stable, then so is H 11(s). But 
this also implies that H,,(s)K(s)=H,2(s) is exponentially stable. 
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Finally 
I+ H,(s)K(s) = 1+[—G(s)K(s)]~ G(s) K(s) (2.58) 
= [I—G(s)K(s)]7! (2.59) 
= H,,(s) (2.60) 


which shows H,,(s) to be exponentially stable. 
Hence the feedback system is exponentially stable. E 


We can move towards a Nyquist-like criterion via the following: 
Theorem 2.8: If K(s) is exponentially stable, then H,,(s) is expo- 
nentially stable if and only if 


(1) det [J — G(s)K(s)] has no zeros in the closed right half-plane 
(including infinity), and 


(2) [I —-G(s)K(s)]~*G(s) is analytic (i.e. has no poles) at every 
closed right half-plane pole of G(s) (including infinity) 


Proof: In the following proof, ‘closed right half-plane’ is abbreviated 
as CRHP. 
‘Only if”: 


. (1) Let G(s)K(s)=N(s)D~ 1(s) be a coprime MFD. Then 
H,,(s)=[I—G(s)K(s)]~* G(s) (2.61) 
= D(s)[D(s)— N(s)]~*G(s) (2.62) 


so H,(s) exponentially stable = D(s)— N (s) has no zeros in 
CRHP since D(s)—N(s) can have no zeros in common with 
N(s) or D(s) (by coprimeness) and every CRHP zero of G(s) is 
also a zero of N(s), because of the stability of K(s). 

Hence, H,,(s) exponentially stable 


=> det [D(s)— N(s)]40 in CRHP 

=> det [J — N(s)D~+(s)] 40 in CRHP. 
(2) Immediate. 
‘Tf’: 


det LJ — G(s)K(s)] #0 in CRHP 
=> any CRHP poles of H,,(s) must be CRHP poles of G(s)(*) 
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But H,,(s) is analytic at CRHP poles of G(s) 

=no CRHP poles of H,,(s) coincide with CRHP poles of 
G(s) 

=> H,,(s) has no CRHP poles by (+). a 


Note that part (2) of this theorem is needed to pick up pole-zero can- 
cellations, and that it is needed even with SISO systems: 


EXAMPLE 2.6 
s—l 1 
KJS T G) = $ 
det [1 — G(s) K(s)] e (looks OK) 
but 
U-GKO)I'60) = 


Of course, each of the last two theorems can be restated for expo- 
nentially stable G(s). 

Theorems 2.7 and 2.8 together give the basis for a generalization of the 
Nyquist stability criterion, which will be developed in the next section. 

If we revert to a negative-feedback convention, instead of the positive 
feedback used in Figure 2.2, we can obtain all the corresponding results 
simply by changing the sign of K(s) (or G(s)). So in Theorems 2.7 and 2.8 we 
merely have to replace [I — GK] by [I+ GK], and we obtain the results for 
negative-feedback loops. 

The conditions which need to be satisfied in order to achieve internal 
stability apply whenever two systems are connected together in a feedback 
loop, and do not depend on the position of those systems in the loop. In other 
words, they apply not only when one system is in the forward path and the 
other in the feedback path, as shown in Figure 2.2, but also to the con- 
ventional configuration when both plant and compensator are in the forward 
path (as in Figure 1.8). Why, then, has feedback design been successfully 
based on the Nyquist stability theorem, which checks, in effect, only part 1 of 
Theorem 2.8? The reason is that all the additional complexity which has 
appeared in this section has been concerned with detecting the presence of 
right half-plane pole-zero cancellations, and thus of hidden unstable modes. 
When designing a compensator ‘by hand’, using the traditional techniques, 
one has complete and explicit control over the location of poles and zeros. 
Thus, as long as the designer obeys the basic injunction: ‘do not introduce 
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right half-plane pole-zero cancellations’, it is enough to use only part 1 of 
Theorem 2.8. In this case the system G(s) usually represents the series 
connection of plant and compensator, while K(s) usually represents a con- 
stant gain, which is therefore exponentially stable. 

This situation continues to hold when one uses the semi-automatic 
methods for multivariable design which we shall introduce in Chapter 4. But 
when one adopts the fully automated synthesis techniques described in 
Chapters 5 to 7, one needs to be sure that unstable hidden modes have not 
been introduced by the computer. In the linear quadratic Gaussian and H,, 
techniques examined in Chapters 5 and 6, the theory of internal stability is 
built into the synthesis theory, so that these techniques produce only inter- 
nally stable designs. But the parameter-optimization techniques described in 
Chapter 7 offer no such guarantee, and either the internal stability of the 
resulting designs needs to be checked, or conditions such as those of Theorem 
2.8 need to be added as constraints during the optimization. 


2.8 The generalized Nyquist stability criterion 


Let G(s) be a square, rational transfer matrix. As we said in the previous 
section, this G(s) will often represent the series connection of a plant with a 
compensator, and we must assume that there are no hidden unstable modes 
in the system represented by this transfer function. We wish to examine the 
stability of the negative-feedback loop created by inserting the compensator 
kI into the loop (i.e. an equal gain k into each loop), for various real values of 
k. To do this we apply part 1 of Theorem 2.8, with K(s)=—KkI. 

Let det [I + kG(s)] have Po poles and P, zeros in the closed right half- 
plane. Then, just as with SISO systems, we have, by the principle of the 
argument, that 


Aarg det [J + kG(s)] = —2x(P, ~ Po) (2.63) 


where Aarg denotes the change in the argument as s traverses the usual 
Nyquist contour once—up the imaginary axis from the origin to some 
arbitrarily large distance, then along a semicircular arc in the right half-plane 
until it meets the negative imaginary axis, and finally up towards the origin; if 
any poles of G(s) are encountered on the imaginary axis the contour is 
indented so as to include these poles. But for closed-loop stability we need P, 
to equal zero, and the poles of det [I + kG(s)] are just the poles of G(s). Hence 
we see that the condition for closed-loop stability is that the image of 
det [I +kG(s)] (as s goes once round the Nyquist contour) encircle the 
origin Po times anticlockwise. 

Note that the argument so far has exactly paralleled that for SISO 
feedback. However, if we stopped here we would have to draw the Nyquist 
locus of det [1+kG(s)] for each value of k in which we were interested, 
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whereas the great virtue of the classical Nyquist criterion is that we draw a 
locus only once, and can then infer stability properties for all values of k. 

It is easy to show that, if /,(s) is an eigenvalue of G(s), then k4,(s) is an 
eigenvalue of kG(s), and 1+ k4;(s) is an eigenvalue of J + kG(s). Consequently 
(since the determinant is the product of the eigenvalues) we have 


det [1+ kG(s)]= I] [1+kA,(s)] (2.64) 
and hence 
Aarg det [I+ kG(s)] =) Aarg[1+kA,(s)] (2.65) 


so we see that we can infer closed-loop stability by counting the total number 
of encirclements of the origin made by the graphs of 1 +k4;(s), or equival- 
ently, by counting the total number of encirclements of —1 made by the 
graphs of kA,(s). In this form, the criterion has the same powerful character as 
the classical criterion, since it is enough to draw the graphs once, usually 
for k=1. 

The graphs of /,(s) (as s goes once round the Nyquist contour) are 
called the characteristic loci. 

The above argument is not complete. Is it really true that the 
encirclements which we are supposed to count are well defined? There is a 
potential problem, as we can see if we consider the transfer-function matrix 


0 1 
G(s)=| 5—1 r (2.66) 
stl 


This matrix has eigenvalues /,(s)= +./[(s—1)/(st+1)]. Neither 4,(j@) nor 
4,(j@) forms a closed curve, so we see that we may have problems counting 
encirclements. This problem arises because the eigenvalues of a rational 
matrix are usually irrational functions. 

It turns out that in practice there is no problem, but this is not easy to 
prove. We shall outline the idea of a proof. First we note that eigenvalues are 
continuous functions of the elements of a matrix. Consequently, if G(s) has 
dimensions m x m, we can distinguish m eigenvalue functions A;(ja), each of 
which is continuous in @ (but the indexing is arbitrary, of course). Further- 
more, eigenvalues of G (jæ) coincide at a finite number of values of œ, at most. 
So the graphs of A;(j@) are indeed m ‘well-behaved’ graphs, with (possibly) 
isolated points of intersection. 

Also, if both characteristic loci are drawn for the above example 
(Figure 2.3), it is seen that together they form a closed curve, since 
1,(j)=Az(—joo) and A,(—joo)=4,(joo). This is always the case, basically 
because G(j0o)=G(—j) (and hence the sets of eigenvalues at joo and —joo 
are the same). So, provided we consider all the characteristic loci together, we 
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Figure 2.3 The two characteristic loci for the system defined by equation (2.66). 


get a closed curve (or possibly a set of distinct closed curves) and can 
therefore count the encirclements of any point by this curve unambiguously. 


Theorem 2.9 (Generalized Nyquist theorem*): If G(s) (as defined 


above) has Po unstable (Smith-McMillan) poles, then the closed-lo 


op 


system with return ratio —kG(s) is stable if and only if the character- 
istic loci of kG(s), taken together, encircle the point —1P, times 


anticlockwise, assuming that there are no hidden unstable modes. 
EXAMPLE 2.7 


1 s—1 S 
Ge a -6 2 S 


The characteristic loci of this transfer function are shown in Figure 2.4. 
Since G(s) has no unstable poles, we will have closed-loop stability if these 
loci give zero net encirclements of —1/k when a negative feedback kI is 
applied. From Figure 2.4 it can be seen that for — oœ < — 1/k < —0.8, for 
—0.4< —1/k<0 and for 0.53 < —1/k<oo there will be no encirclements, 
and hence closed-loop stability will be obtained. For —0.8< —1 /k< —0.4 
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Figure 2.4 The two characteristic loci for the system defined by 
equation (2.67). 


there is one anticlockwise encirclement, and hence closed-loop instability, 
while for 0< —1/k<0.53 there are two clockwise encirclements, and 
therefore closed-loop instability again. 


2.9 The generalized inverse Nyquist stability criterion 


It is possible to restate the generalized Nyquist stability criterion in terms of 
the characteristic loci of the inverse return ratio G~ *(s) rather than G(s): 
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Theorem 2.10 (Generalized inverse Nyquist theorem°): If G(s) (as 
defined in Section 2.8) has Zo (transmission) zeros in the closed right 
half-plane, then the closed-loop system is stable if and only if the 
characteristic loci of G~‘(s)/k, taken together, encircle the point 
— 1 Zo times anticlockwise, assuming that there are no hidden 
unstable modes. 


Proof: Since no pole-zero cancellations can occur between G(s) and 
kl, it is sufficient to check part 1 of Theorem 2.8, with F(s)= — kl. 


Now, 
I+kG(s)=[G~ \(s)/k+1]kG(s) (2.68) 
SO 
det [J +kG(s)] =k” det [G~ +(s)/k + I] det G(s) (2.69) 
=k" det [G+ (s)/k +I] as) (2.70) 
p(s) 


where z(s) and p(s) are the zero and pole polynomials, respectively, 
of G(s). 

Now let z,(s) and p,(s) be the zero and pole polynomials of 
I +kG(s). Then p,(s)= p(s), since the poles of J + kKG(s) are just those of 


G(s). Hence 

z,(s)=k" det [G7 1(s)/k + I] 2(s) (2.71) 
or 

k” det [G—3(s)/k+ 1] = 20 (2.72) 


z(s) 
So if z,(s) has Z, roots in the CRHP, and z(s) has Zo roots there, then 
Aarg det [G1 (s)/k +1] = —22(Z,—Zo) (2.73) 
But for stability we need Z,=0-—that is, 


Aarg det [G7 + (s)/k+I]=21Zo (2.74) 


The restatement in terms of encirclements by the characteristic loci of 
G~1(s)/k follows as in Theorem 2.9. a 


Note that if {/;(s)} is the set of eigenvalues of G(s), then {4;7 '(s)} is the set of 
eigenvalues of G~ 1(s). 
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If the inverse Nyquist criterion is used, then the behaviour of the 
inverse characteristic loci on the semicircular portion of the Nyquist contour 
must be examined carefully. The inverse loci usually become large as s 
becomes large, so encirclements may be introduced as s moves around this 
part of the contour. 


2.10 Nyquist arrays and Gershgorin bands 


The Nyquist array of G(s) is an array of graphs (not necessarily square), the 

(i, j)th graph being the Nyquist locus of g,;(s). Use is also made of the inverse 

Nyquist array, which is the array of graphs of Nyquist loci of the elements of 

G~ t(s). (Of course, the inverse array is defined only when G(s) is square.) 
The key to Nyquist array methods is: 


Theorem 2.11 (Gershgorin’s theorem): Let Z be a complex matrix of 
dimensions m x m. Then the eigenvalues of Z lie in the union of the m 
circles, each with centre z; and radius 


m 
È |Zil i=l,. <., M 
j=1 
j#i 


They also lie in the union of the circles, each with centre z; and radius 


m 
y [Zh i=1,...,m 
j=1 


j#i 
Proof: See Kreyszig (1972) or Rosenbrock (1974). a 


Consider the Nyquist array of some square G(s). On the loci of g;;(ja) 
superimpose, at each point, a circle of radius 


X lgij(J@)| or X Iga (jo) 
j=l j=l 

j#i j#i 
(making the same choice for all the diagonal elements at each frequency). The 
‘bands’ obtained in this way are called Gershgorin bands; each is composed of 
Gershgorin circles. Gershgorin bands are illustrated in Figure 2.5. 

By Gershgorin’s theorem we know that the union of the Gershgorin 
bands ‘traps’ the union of the characteristic loci. Furthermore, it can be 
shown that if the Gershgorin bands occupy distinct regions, then as many 
characteristic loci are trapped in a region as the number of Gershgorin bands 
occupying it. So, if all the Gershgorin bands exclude the point —1, then we 
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(1,2) element 


Figure 2.5 A Nyquist array, with Gershgorin bands. 


can assess closed-loop stability by counting the encirclements of — 1 by the 
Gershgorin bands, since this tells us the number of encirclements made by the 
characteristic loci.® 

If the Gershgorin bands of G(s) exclude the origin, then we say that 
G(s) is diagonally dominant (row dominant or column dominant, if appli- 
cable). Note that to assess stability we need [J+G(s)] to be diagonally 
dominant. 

The greater the degree of dominance (of G(s) or of J+ G(s)) — that is, 
the narrower the Gershgorin bands- the more closely does G(s) resemble m 
non-interacting SISO transfer functions. 

Clearly, from the generalized inverse Nyquist theorem we can obtain a 
stability criterion in terms of the Gershgorin bands of G~1(s): if all the 
Gershgorin bands of G~ '(s) exclude the point — 1, we can assess closed-loop 
stability by counting the number of encirclements of —1 made by these 
bands. In this case we require 1+G~‘(s) to be diagonally dominant on the 
Nyquist contour (including the semicircular part of the contour). 

These Nyquist-array-based tests check sufficient, but not necessary, 
conditions for stability (or instability). If any Gershgorin band does overlap 
the point — 1, in other words if I +G (or 1+ G~') is not diagonally dominant, 
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Figure 2.6 Feedback loop containing the system defined by equation (2.75). 


then we cannot infer whether the system is stable or not. But suppose that we 
replace G by 


G=XGXx"! (2.75) 


This is a similarity transformation, so the characteristic loci of G must be 
the same as those of G, and hence we can check stability by displaying 
Gershgorin bands of G (or G~') just as well as we can with those of G. 
Physically, equation (2.75) corresponds to inserting a system X at the output 
of G, and X~! at the input of G, as shown in Figure 2.6. The point of doing 
this is that G may be diagonally dominant, even if G is not, for some choices 
of X. 

A method for finding a suitable X has been given by Mees (1981). For 
any matrix M with elements m;;, we define 


abs M =[|m;;|] (2.76) 


that is, the elements of abs M are |m;;]. Such a matrix is called positive. If M is 
square, let the eigenvalues of abs M be {/,,/,,...,4,}, and let them be 
ordered so that |A,| >|4,| >... . M is called primitive if (abs M)" has positive 
entries only for some integer r. For primitive positive matrices A, is real and 
A, >|A,| for i#1; 2, is called the Perron—Frobenius eigenvalue of M, and we 
shall denote it by Ap(M). The corresponding left and right eigenvectors of 
abs M are called the Perron—Frobenius eigenvectors of M (and are also real 
and positive). 
Now, let 


M giag = diag {m; ;, M225- . , Mnn} (2.77) 


Mees proves the following theorem: 
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Theorem 2.12 (Mees, 1981): If G is square and primitive, then there 
exists a diagonal matrix X such that 


G=XGXx~"! 


is diagonally dominant, if and only if 


Ap(GGgiag) < 2 (2.78) 

If (2.78) is satisfied, and the Perron—Frobenius left eigenvector of GG4jJ, is 
(x1,X2,...,X,)', then an X which achieves diagonal dominance is 

X = diag {x1,%2,..-,X,} (2.79) 


Note that the X found by this theorem can be interpreted as a scaling 
of the outputs of G, and X~* as a scaling of the inputs of G, since X is 
diagonal and real. A different X is required at each frequency, of course; this 
is no problem if we are using X only for analysis, since X does not correspond 
to a system that has to be built. Later we shall use this theory for design, and 
we shall then have to build systems whose gain behaviour approximates that 
of X as frequency varies. 

Theorem 2.12 requires G to be primitive, but in practice this is not a 
severe restriction. If all the elements of G are non-zero then G is certainly 
primitive. A problem occurs only if some elements of G are fixed at zero as 
frequency varies. If this happens then G is usually block-diagonal, or at least 
block-triangular, or can be made so by reordering inputs or outputs. The 
characteristic loci of G are then just the characteristic loci of the (primitive) 
blocks that occur on the principal diagonal of G, and Theorem 2.12 can be 
applied separately to each of these blocks. 

Mees also shows that the X produced by Theorem 2.12 is optimal, in 
the sense that it gives the narrowest possible Gershgorin bands obtainable 
with any diagonal similarity transformation. 

Note that Theorem 2.12 should be applied to 1+G, rather than 
G, if we do not want the Gershgorin bands to cover the point —1. Since 
X (1+ G)X~*=1+XGX~!, we can still infer stability from the Gershgorin 
bands of XGX~?. 

There is one form of stability theorem which is expressed in terms of 
Gershgorin bands but which we have not presented yet, and which can be 
very useful for checking the robustness of particular designs as gains or other 
real parameters vary, as we shall see in Chapter 3. This form is relevant when 
the feedback loop contains a square transfer function G(s) connected in series 
with a diagonal gain matrix 


K =diag {k,, kp,... Kp} (2.80) 
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and each k; is a constant (usually real). The difference between this and the 
feedback loops considered so far in Sections 2.8 and 2.9, and this section, is 
that the gain (k;) in each loop may now be different. 


Theorem 2.13 (Rosenbrock, 1970): Suppose that G(s) is square, that 
K =diag {k,,...,k,} and that 


1 
g+ > DC MO] (2.81) 
i j=+*i 


for each i and for all s on the Nyquist contour; and let the ith 
Gershgorin band of G(s) encircle the point — 1/k;, N; times anticlock- 
wise. Then the negative feedback system with return ratio — G(s)K is 
stable if and only if 


LNi=Po (2.82) 


where Po is the number of unstable poles of G(s), and there are no 
hidden unstable modes. 


Proof: We know from Section 2.8 that a necessary and sufficient 
condition for stability is that 


N=Po5 
where N is the number of anticlockwise encirclements of the origin 


made by det [I +G/(s)K], as s moves once around the Nyquist con- 
tour. So, to prove the theorem we need to prove that 


N=} N; (2.83) 
To do this, note that the ith Gershgorin band of G(s)+ K ~* encircles 
the origin N; times anticlockwise (the condition (2.81) ensuring that it 
does not cover the origin). But the union of these bands contains the 


characteristic loci of G(s)+K~+; call these characteristic loci 4;(s) 
(i=1,...,m). So we have, as in Section 2.8, that 


Aarg det [G(s)+K~*] =} Aarg TAC) (2.84) 


=P N, (2.85) 
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But 
Aarg det [I + G(s) K ] =Aarg det [G(s)+ K~1]+Aarg det(K) 
(2.86) 
= Aarg det [G(s)+ K~*] (2.87) 
Hence 
N=N, a 


i 


Theorem 2.13 can clearly be restated in terms of column dominance 
instead of row dominance, by replacing (2.81) by 


1 
uoti 


T|? È lgx) (2.88) 
i j+i 


and indeed the theorem holds if either (2.81) or (2.88) holds at each point on 
the Nyquist contour. A corresponding theorem can also be stated in terms of 
the inverse transfer function G~ ‘(s) (see Exercise 2.11). 


2.11 Generalized stability 


So far we have used the term ‘stability’ in its familiar and usual sense, and 
have tested for its existence by checking whether any poles lie in the right 


Figure 2.7 Generalized stability: closed-loop poles may not lie in the hatched area 
(for example). 
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half-plane. We can obviously transfer all the results to discrete-time systems 
by replacing the left half-plane by the interior of the unit circle. But we can 
also replace the left half-plane by any other region around which we can draw 
a single, closed boundary. (If the left half-plane does not appear to meet this 
requirement, consider the points +joo and —joo to be the same.) 

Suppose, for example, that we require of a system not merely that it be 
stable, but also that all its poles and hidden modes decay at least as fast as 
e~ 4, and to have damping factors no smaller than 1/,/2; in other words, they 
should all be located to the left of the hatched region shown in Figure 2.7. To 
check whether this condition is satisfied, we can use all the results of Sections 
2.7 to 2.10, replacing ‘right half-plane’ by the hatched region shown in Figure 
2.7, and ‘imaginary axis’ by the boundary of this region, as shown by the 
arrows in Figure 2.7. 

In this way we can generalize the notion of stability without needing 
any new theory. 


SUMMARY 


In this chapter we have introduced transfer function matrices as des- 
criptions of multivariable, linear dynamic systems, and related them to 
state-space models. Matrix fraction descriptions were also introduced as 
alternative representations which are often useful for resolving theoretical 
questions. 

We have generalized the notions of poles and zeros to multivariable 
systems by means of the Smith—McMillan form of a transfer function matrix. 
The poles determine system stability, just as is the case for SISO systems; in 
the next chapter we shall see that the zeros may limit the achievable 
performance of multivariable feedback systems. 

The stability of feedback systems has been considered. The notion of 
‘internal stability’ was introduced, and a generalization of Nyquist’s stability 
theorem to multivariable systems was developed. This theorem was stated 
in terms of characteristic loci, namely graphs of the eigenvalue functions of 
a multivariable return ratio. Sufficient, but not necessary, conditions for 
closed-loop stability or instability have been given, stated in terms of 
Gershgorin bands, along with means of reducing their conservativeness. 


Notes 


1. The definition of the poles and zeros by means of the Smith-McMillan form is 
taken from Rosenbrock (1970). Most of Sections 2.2 to 2.5 can also be traced 
back to Rosenbrock’s work. 

2 This is the Laurent series expansion of G(s) about s = oo. 
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The definition of internal stability, and much of the rest of Section 2.7, follows 
Desoer and Chan (1975). 


This theorem was first conjectured by MacFarlane in about 1970. It has been 
proved several times, in various degrees of generality and correctness, and by 
several different approaches. The argument outlined in the text follows Desoer 
and Wang (1980). This paper also proves the theorem for the case when G(s) 
is irrational, and relies heavily on the use of graph theory. A very different 
approach, using the theories of Riemann surfaces and algebraic functions, 
is taken by MacFarlane and Postlethwaite (1977), Postlethwaite and 
MacFarlane (1979), and Smith (1984). 


This statement of the generalized inverse Nyquist theorem is taken from 
Postlethwaite (1977). 


Stability theorems phrased in terms of Gershgorin bands were first developed 
by Rosenbrock (1970). His proofs make no use of characteristic loci. 


EXERCISES 


2.1 


2.2 


2.3 


2.4 


2.5 


mm 


If (A, B,C, D) is a realization of G(s), and A=T~'AT, B=T™~'B and 
C=CT, verify that (A, B, C, D) is also a realization of G(s). Also show 
that G(oo)=D. 
If the discrete-time state-space model 

Xk+1 = Ax; + Bu, Y = Cx, + Du, 


has transfer function G(z), show that G(z) is related to the realization 
(A, B, C, D) by the same formula that holds for continuous-time sys- 
tems. (As a consequence of this result, everything in this chapter holds 
for discrete-time systems.) 


Show how the elementary operations on polynomial matrices can be 
represented by multiplication with suitable matrices (the elementary 
matrices). Verify that such matrices are unimodular. 


Find the Smith-McMillan form of 


1 | 1 0 
(s+1)(s+3)| —1 rd 


Hence find its poles and zeros. 


Find MFDs of 
(a) 
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2.6 


2.7 


2.8 


2.9 


2.10 
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(b) 
1 -1 


s? +s—4 2s?—s-—8 
s?—-4 25-8 


Hence find their poles and zeros. 


1 
(s+ 1)(s+2) 


By considering a coprime MFD of G(s), show that the poles of 
det [I +kG(s)] are the same as the poles of G(s), if k is a scalar 
constant. 


Show that the zeros of a transfer function in the forward path of a 
feedback loop remain as zeros of the closed-loop transfer function, 
unless they are deliberately cancelled. Show by giving an example that 
this is not true of the zeros of the elements of the transfer function. 


(a) Find a state-space realization of 


+5 ı 


s*4+2s+3 stl 


G(s)= 
s+4 s+2 


Check the minimality of your realization (preferably using 
suitable software), and hence find the McMillan degree of G(s). 
What are the poles of G(s)? 


(Solution. McMillan degree=5, poles = {—1, —2, —4, 
—1+j,/2}) 
(b) Repeat (a), but with the (1, 1) element of G(s) replaced by 
(s+ 5)/(s? + 3s +2). 
(Solution: McMillan degree =4, poles={ —1, —2, —2, — 4}) 
If G(s) is square and not identically singular, show that the zeros and 


poles of G~ + (s), including those at infinity, are just the poles and zeros, 
respectively, of G(s). 


(a) Show that the transfer functions H,,, H,2, H2, and H22, 
which appear in equation (2.53), are given in terms of K and G 
(which appear in Figure 2.2) by 

H,,=(1-KG)"*, H,.=(I-KG)"'K 
H,,=(1—GK)~'G, H,,=(I-GK)"' 


2.11 


2.12 
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(b) Hence show that 


Ay, H J| -K rę 
H,, H,, J —G I 
(Hint: X [I+ YX] '=[1+XY]- X) 


(c) Deduce that a necessary condition for internal stability is 
that det[I—GK] has no zeros in the CRHP. Why is it not 
sufficient? 


W X 
(Hint: det | YZ | =det Wdet[Z—YW"'X], known as 


Schur’s formula) 


Let G(s) be a square transfer function, let ĝi; be the (i, j) element of the 
inverse G(s)=G~ '(s), and K=diag {k,,k,,..., km}. Suppose that 


Jduls)+kil > D lau) 


for each i and for all s on the Nyquist contour. Let the ith Gershgorin 
band of G(s) encircle the point —k,, N ; times anticlockwise. Show that 
the negative feedback system with return ratio — G(s)K is stable if and 
only if 


XN i=Zo 
if G(s) has Zp transmission zeros in the right half-plane. 


(This exercise is based on Section 3.5 of Owens (1978).) Suppose that a 
strictly proper system with transfer function G(s) has m inputs, out- 
puts and states, and a minimal state-space realization (A, B,C, 0. 
and that B~! and C™! exist. Show that 


im) 


G~'(s)=sAg+A, 


and find the constant matrices A, and A, in terms of A,B and C. 
Where are the zeros of G(s) located? : 
Suppose that a compensator of the form 


haa, 


is used, where a j and f; are real parameters (j=1,..., m). By writing 
I+GK=G(G"!+K), show that the closed loop is stable for any 
positive values of a; and £,, and that all the closed-loop poles are real. 


K(6)= A diag fa; +8 +É. 
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(Note that this holds even if G(s) is unstable. Owens calls G(s) a 
multivariable first-order system; process plant may sometimes be 
modelled by such systems. The compensator K (s) has a fixed struc- 
ture, determined by Ay and A,, and parameters a; and £8; which can be 
tuned (possibly on-line) to give acceptable dynamic characteristics.) 


a a 
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3.1 Introduction 


The main purpose of using feedback is to reduce the effects of uncertainty. 
Some uncertainty is always present, both in the environment of the 
system-we do not know in advance exactly what disturbance and noise 
signals the system will be subjected to, and in the behaviour of the system 
itself— we know our models are not perfect, and the behaviour of the system 
may change in unpredictable ways. A further reason for using feedback is to 
stabilize an unstable system. 
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We shall to some extent separate out the assessment of how well a 
feedback system deals with unwanted or unexpected signals—its per- 
formance—from the assessment of how resilient it is in the face of internal 
changes in behaviour — its robustness. But we shall see that the same tools are 
used to evaluate these two aspects of a feedback system’s behaviour, 
and towards the end of the chapter we shall formulate a design problem in 
which performance goals and robustness goals become mathematically 
indistinguishable. 

A feedback system’s ability to follow a reference input is usually 
considered to be an important part of its performance. This can also be 
assessed in the same way as the system’s ability to recover from external 
disturbances, but it should be recognized that this is not, strictly speaking, a 
part of feedback design. As we saw in Chapter 1, the transfer function between 
a reference input and a system output can be modified by use of a pre-filter 
which is outside the feedback loop. We can therefore picture a two-stage 
design procedure in which the feedback loop is designed first, in order to 
reduce the effects of uncertainty, and the overall closed-loop transfer function 
is then adjusted by inserting a pre-filter, which requires only open-loop 
design. This distinction becomes blurred, however, when no pre-filter is to be 
included—either because there is no reference input, as in a regulator, or 
because practical considerations exclude it. In this case the response to a 
reference signal is essentially the same as the response to a disturbance at the 
plant output, provided there are no dynamics in the feedback path. The 
distinction is also blurred when an optimization technique is used which 
designs both the feedback compensator and the pre-filter at the same time. 


3.2 Principal gains (singular values) 


In a SISO system the performance of a feedback loop is determined by the 
variation of the loop gain with frequency; disturbance rejection, noise trans- 
mission and differential sensitivity to parameter variations all depend only 
on the gain (assuming that stability is achieved). If the open-loop transfer 
function (return ratio) has no right half-plane zeros, then stability margins 
and the closed-loop transient response are also determined by the open-loop 
gain characteristic. 

In attempting to extend this correlation to multivariable feedback, the 
main problem is that a matrix does not have a unique gain: the norm 
|| G(s)u(s)|| depends on the direction of the vector u(s). However, we can 
bound the ratios 


IGu) 1G- 
or t DOl 
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using matrix norms. (We shall usually assume that G(s) is square and 
invertible, but neither of these assumptions is necessary.) Thus we are led to 
replace the idea of a single gain by the notion of a range of gains, this range 
being bounded below and above. 

If ||x|| denotes any vector norm, then an induced (or subordinate) 
matrix norm is defined by 


|| Gx || 
G || = sup -—— 
reo xl 


(3.1) 


In particular, if we use the Euclidean vector norm (for complex 
vectors) 


Ix |] =x) (3.2) 
then the induced matrix norm is the Hilbert or spectral norm: 
IG = 


where ē° is the maximum eigenvalue of GĦG (or of GG"). Here x# denotes xT, 
and similarly for GĦ. Now, if G has m rows and | columns, and m >, then the 
positive square roots of the eigenvalues of GĦG are called the singular values 
of G. (If m<I then the square roots of the eigenvalues of GG# are the singular 
values of G.) 

If instead of G we have G(s), and set s= jo (0<@<oo), then the 
singular values of G(jæœ) are functions of œ, and they are then called the 
principal gains’ of G(s). We shall denote them by {o,(@)} when we wish to 
emphasize their dependence on frequency, or by {o,(G)} when we wish to 
distinguish the principal gains of G from those of some other system. We shall 
also adopt the ordering o,;>0,>... 2Om, When necessary, and denote 
Oi, Om by & and g respectively, when we wish to emphasize the use of the 
largest or smallest principal gain. 

Note that 


G(G(jo))= || G(jo) |, (3.3) 


but that this is a norm on the matrix G(jw), which changes with œ. Later (in 
Section 3.8) we shall introduce ||G||, and |G||,., norms on the transfer 
function G that are independent of frequency. 


The singular-value decomposition 


Let X=diag {0,, 05, oe Omp and let G be a complex matrix. Then, G can 
always be written as ` 


G= YEU" (3.4) 
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where 
ifmel: ifm<l: 
Yec”*! Yer’ 
S eR! Sere 
UH ec!” UHec™”*! 
Yoy=i, yy Yy"=],, 


U¥U=UU"=I,  U#U=I, 


and C™*! denotes the set of complex matrices with m rows and I columns, and 
R™*! denotes the set of real matrices with these dimensions. 

This is known as the singular-value decomposition. For further details 
see Golub and van Loan (1983) or Stewart (1973). Note that this decomposi- 
tion is not unique: G= Y’ZU"", where Y'= Yei? and U’=Ue 3%, for any 9, 
also gives a singular-value decomposition. However, the c; are unique. 

We shall now derive some properties of the singular-value decom- 
position. 


GGH=(YrU#)( YrUs}F 
=(YZU#\(ULZY*) 
= Yr? y" (3.5) 
If m<I then Y¥=Y~', which shows that in this case Y is the matrix of 
eigenvectors of GG", and {a7} are its eigenvalues. 

GHG =(ULY#)\( YrU") 

=Uzr?2U# (3.6) 
If m>/ then U#=U ~', so in this case U is the matrix of eigenvectors of GHG, 
and {o?} are its eigenvalues. 

Note that rank(G)=rank(Z), since U and Y are unitary. So if 
rank(G)=r, then only the first r singular values of G are positive; the 
remainder are zero. 

Consider 

H=Uz'¥Y# (3.7) 
Then 

HGH=(US~!Y*(YruU*\(Ux"* Y*) 

=H (3.8) 
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and 
GHG=(YZXU#\(UzZ-!Y#)\( YIU) 
=G (3.9) 
which shows that H is a pseudo-inverse of G. Thus we have 
GaUs-*y# (3.10) 


where G denotes the pseudo-inverse of G. We have assumed here that 
rank(G)=min(I, m). If r<min (l, m) then instead of £~! we take 


zo? 0 
0 0 
where 2, =diag {o,,...,0,}. 
If G is square and non-singular, then 


G-1=Ux71Y# (3.11) 
from which 
1G" (joo) ||, =—— (3.12) 
BONE oan) i 


Hence we have that 


KOOI. 


| y(ja) I ` g(o) 
and 
lajol E sio 
lujo ` 


Putting y(j@)=G(jw)u(jo), we obtain 


| G(ja)u(jo) | 


| - 
a(w) < ua) | < lw) (3.13) 


which shows that the gain of a multivariable system is sandwiched between 
the smallest and largest principal gains. 

With each principal gain we can associate a pair of principal directions, 
as follows. Assume that m>I, and let the rows of Y" be y4, y#,...,y4, and 
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the columns of U be u,,u2,...,u,. Then by using (3.4) we can write 
y=Gu (3.14) 
l 
= J. ony, ugu (3.15) 
k=1 


Since || u;,|| =1, we have 
Jugu] < llul lull =e (3.16) 


the equality holding only if u=au, for some scalar «. Suppose, then, that 
u=au,, with |a|=1, so that ||u||=1. Then uřu=0 for ki, and hence the 
resulting output is 


Yy =0;y:% (3.17) 
so that 
ly ll =c; (3.18) 


This shows that the gain of the system is precisely o; if the input signal is in the 
direction of u;. The set {u,,u2,...,%} is called the set of input principal 
directions of G. In particular, the greatest possible gain ¢=0, occurs if the 
input signal is in the direction of u,, and the smallest possible gain =o, 
occurs if it is in the direction of u. Note that the principal directions are 
orthogonal to each other (that is, uj'u;=0 if i#j), since U#U=I. 

If the input vector is in the direction u,, then (3.17) shows that the 
output vector is in the direction y;. The set {y1, Y2,-. -» Yı} is called the set of 
output principal directions. Again, these are orthogonal to each other. 

A useful characteristic of a system is its condition number, which is 
defined as 


cond (G)= on (3.19) 


and which depends, of course, on frequency. In numerical analysis the 
condition number measures the difficulty of inverting a matrix (see 
Section 8.2.2). It has been argued that it has a control-theoretic significance, 
in that it measures the inherent difficulty of controlling a given plant. We 
shall examine this point later. 
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3.3 The use of principal gains for assessing 
performance 


From Chapter 1, we have, for the system shown in Figure 3.1, the following 
fundamental relations: 


y(s)=S(s)d(s) + LI — S(s)] P(s)r(s)— [I —S(s) ]m(s) (3.20) 
where 

S(s)=[I+G(s)K(s)]~* (3.21) 
is the sensitivity, or alternatively 

y(s)=LI — T(s)]d(s) + T(s)P(s)r(s)— T(s) m(s) (3.22) 


where 


T(s)=S(s)G(s)K(s) | (3.23) 


is the closed-loop transfer function. 

Obviously, what we should do to assess the disturbance-rejection 
(sensitivity) properties of the loop is to examine the principal gains of S (see 
Figure 3.2). If the region lying between a(S) and a(S) is very narrow, then we 
are almost back with the SISO single gain, and we can describe the loop’s 
sensitivity properties very accurately. 

Usually the region will not be narrow — in this case, since we are always 
interested in keeping sensitivity as small as possible, it is the upper boundary 
of the region (that is, ¢) which will be important. 

To assess the propagation of measurement noise we look at the 
principal gains of T (see Figure 3.2). Again, the upper boundary is important, 
since we wish to minimize the propagation of measurement noise. Note that 


d(s) 


m(s) 


Figure 3.1 Standard feedback configuration. 
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Figure 3.2 Smallest and largest principal gains of multivariable sensitivity function 
S (solid curves) and complementary sensitivity function T (dashed curves). 


there are now two loop bandwidths: w,,, the bandwidth at the output, and 
@,,, the bandwidth at the input. 

As with SISO feedback, there is a conflict between keeping a(S) small 
and keeping (T) small, because of: 


Theorem 3.1: 

|1-a(8)|<a(T)<1 4 a(S) (3.24) 
and 

|1-—6(T)|<a(S)<1+6(T) (3.25) 


Proof: is a matrix norm, which satisfies the triangle inequality 
({A+B|| <|[A]]+|| Bll). Therefore 


o(T)=a(1—S)=a(I+(—S)) 
<1+a(S) (3.26) 
Now, 
|| Sx || 


Fae Sa 


Let w be a vector for which this supremum is achieved — that is, 


Sw _ 
Iwi 


a(S) 
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Then 
|| Zw] =| —S)w|| = ||w—Sw| 
Now, 
||w—Sw||>|}wl|—||Sw|| (from triangle inequality) 
and 
||w—Sw]||>||Sw||—||w|| (from triangle inequality) 
Hence 
|| Zw || >| lwil Sw] | 
So 
Sw 
Te? | Tal 
=|1—4(S)| 
Therefore 


o(T)2>|1—a(S)| (3.27) 


Relations (3.26) and (3.27) together prove (3.24). 

Clearly the results for ¢(S) in terms of (7) can be proved in the same 

way. ai 

If we have only one degree of design freedom (i.e if P(s)=J), then the 
reference-signal transmission properties are again determined by the princi- 


pal gains of 7(s). If we define the transmission bandwidth œ, to be that 


frequency at which a( T(ja,,))= o(7(0))/,/2, then it is clear that even when 
there is just one degree of freedom, 


Dry < Oby (3.28) 
except in very special circumstances (see Figure 3.2). In the frequency range 
Wry LOS Wry (3.29) 


sensor noise is being propagated without reference signals being followed 
particularly well, so one of the design aims could be to 


minimize (@,,—@,,) (3.30) 
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(Of course, the exact definitions of these frequencies are rather arbitrary, and 
may be changed to suit particular problems, but the principle will always 
hold.) Note that this design aim appears only for multivariable systems, since 
yy =@,y for SISO loops (and for one degree of freedom). 

If P(s)#I, so that we have a two degrees of freedom design, then 
the transmission of reference signals is determined by T(s)P (s), and therefore 
the relevant principal gain is o( TP ). Now œ, is defined by a( TP ) rather than 
o(T), and we can therefore have a more ambitious design aim: 


maximize (@,,—@py) (3.31) 


the point being that w,, > ,, is now possible. 
Another fundamental relation for the feedback loop is 


u(s) =F; *(s)K(s)P(s)r(s)—F; *(s)K(s)[m(s)+4(s)] (3.32) 
where 
F,(s)=I+ K(s)G(s) (3.33) 


If we are concerned to keep the control signals small, then clearly we 
must keep (Fp 1K) small. Since 


6(F;*K)<6(F;*)a(K) 
(K) 


a(F;) 


then to ensure small (F7 1K), we need to make o(F;)>4a(K). But 


o(F,)=o(1+ KG)<a(I+KG) 
<1+6(KG) 
<1+6(K)a(G) (3.34) 


So if o(F;)>6(K), then 1+¢(K)a(G)>6(K), which implies that 


oe 
AK) a(G)>1 (3.35) 


This is only possible if ¢(G)> 1 or @(K)« 1. So, at frequencies at which 
6(G) is not much larger than 1, control signals can be minimized only by 
keeping (K) as small as possible. 

If we use the largest or smallest principal gains to assess performance, 
we are in effect assessing the worst-case performance, in the sense that we are 
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examining the effects of least favourable signal directions. But in many 
multivariable systems we know in advance that certain signal directions can 
never occur, or are so unlikely that they need not be considered. In such cases 
the assessment may be unnecessarily pessimistic if the least favourable signal 
directions are among those which cannot occur in practice. 

Consider, for example, simultaneous control of the yaw and roll angles 
of an aircraft. If we take the positive directions of yaw and roll to correspond 
to yawing of the nose to starboard, and a clockwise roll when looking 
towards the nose, then output disturbances — namely disturbances of yaw and 
roll—of the same sign are relatively easy to correct: the control-surface 
movements required to correct for each of these disturbances are consistent 
with each other. Disturbances having opposite signs, however, are much 
harder to correct: correction of yaw requires one set of rudder and aileron 
deflections, while correction of roll requires approximately the opposite set of 
control-surface deflections. The largest principal gain of the sensitivity func- 
tion S will therefore correspond to disturbances of this kind. If we knew, 
however, that such disturbances could never occur, then it would be clearly 
inappropriate, or at least unnecessarily conservative, to use this principal 
gain as an indicator of performance. 

We see, therefore, the need to modify the use of principal gains if the 
possible signal directions are restricted. If the signals are restricted to a 
subspace of the signal space then the modification is rather obvious. Suppose, 
for example, that the disturbance is confined to a p-dimensional subspace. 
Then we can write it as 


d=Q6 (3.36) 


where 6 is a p-dimensional vector, and Q a matrix whose columns span the 
required subspace. If the columns of Q are chosen to be orthonormal,” so that 
Q"Q=I, then ||d||=||6||. Now let r=0 and m=0 in (3.20), so that 


y=SQ06 (3.37) 


We obtain an accurate measure of performance in the face of realistic 
disturbances by examining ¢(SQ), the largest principal gain of the modified 
sensitivity SQ. Once we use such modified sensitivities, however, results such 
as Theorem 3.1 cease to apply. 

Returning to the example of control of an aircraft, suppose that a 
typical disturbance results in a change in yaw of « units and a change in roll of 
B units. This can be represented by taking 


jan zH 3.38 
— V@? +B) B Gas 
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and 6 is just a scalar in this case. Here Q is written as a constant, but it can just 
as well be a transfer function. In the present example we may prefer to take 


1 a 
00- l (3.39) 


for instance. 
Perhaps a more realistic representation of possible disturbances is to 
confine the disturbance signal to a region which is not a subspace, such as 


d=ad,+fd,, aß>0 (3.40) 


(where « and ĝ are real scalars). In this case d is confined to a convex cone, 
and it is appropriate to find the largest value of || Sd ]||/||d || in this cone. This 
value will occur either when the signal direction coincides with one of the 
system’s input principal directions —if such a direction lies within the cone, or 
somewhere on the boundary of the cone. 

A third possible representation of realistic disturbances is obtained by 
using equation (3.36), but with Q a square matrix, and 6 of the same 
dimension as d, so that the disturbance is not confined to a subspace. Now, 
however, the columns of Q should have unequal norms, so that some 
directions are emphasized more than others. The choice of weightings is 
usually rather subjective, unless some specific information, such as the 
probability distribution of possible disturbances, is available. For example, if 
it is known that the disturbances have a multivariable Gaussian distribution 
with covariance V, then Q can be chosen as a square root of V: 


Q=V3 (3.41) 


In such a case, however, when a description of the disturbances as a 
stochastic process is available, one can obtain a statistical measure of 
performance by examining all the principal gains of SQ rather than just the 
greatest one, as described in Section 3.7. 

As an example of this method of representing disturbances, and 
referring back to the example of yaw and roll control, suppose that disturb- 
ances in the direction d, =[3, 1]’ are considered to be most typical, while 
those in the orthogonal direction d,=[1, —3]™ are least typical, and no 
direction is to be totally excluded. Then we take 


o- i a> (3.42) 
ao —3 


probably making a rather arbitrary choice for «, such as «= 10. This choice of 
Q would also be appropriate if we knew that the disturbance was a Gaussian 
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random variable with covariance 


v=o" (3.43) 
9a? +1 307-3 
eae oe Ce 


3.4 Relations between closed-loop and open-loop 
principal gains 


For SISO feedback, closed-loop performance specifications can be easily 
converted into requirements on the open-loop gain (that is, the modulus of 
thé return ratio). We now seek to express closed-loop requirements in terms 
of open-loop principal gains, for multivariable systems. 

First, we summarize our closed-loop requirements: 
(1) Sensitivity: keep o[(1+GK)~'] as small as possible. 


(2) Noise propagation: keep [I —(I +GK)~!] as small as possible. (This 
conflicts with 1.) 


(3) Tracking of reference signal (for one degree of freedom): keep 
o[1—(I+GK)~*]~1 and 6[I—(1+ GK)~!]~1. (This conflicts with 
2, but not with 1.) 


(4) Minimization of control energy: keep (K) as small as possible. (We 
expect this to conflict with 1 and 3.) 


We shall need the following lemma: 
Lemma 3.1: 

max (0, 4(Q)—1)<6(Q+1)<6(Q)+1 (3.45) 
Proof: The right-hand-side (RHS) inequality follows immediately 
from the triangle inequality since &(.) is a norm. To prove the left- 
hand-side (LHS) inequality, consider the identity 

Q=I+Q-I 
Therefore 

a(Q)<G(I1+Q)+1 (triangle inequality) 
But 

o(Q+1)>0 
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SO 


6(Q+1)>max (0, E(Q)— 1) E 


We shall also need: 


Lemma 3.2: 


max (0, o(Q)—!)<a(Q+I)<a(Q)+1 (3.46) 


Proof: LHS inequality: Let w be a vector such that 


MO+Dwl _ oan 
[w] 
Then 
|(Q+1)w| =||Ow+ wl] > |Owl|—| wl] (triangle inequality) 
Therefore 
Owl —lIwl]| <o(O+1) w| 
Hence 


|| Qw | 


—1<a(Q+I) 
lw || 


g(Q)—-1< 


and since (9 +1)>0, the LHS inequality is obtained. 


Qu | 
lol 


RHS inequality: Let v be a vector such that =g (Q). Then 


IG+Qwi_10v+vl 
s+) < — Toy el 


_l2| 


= 
lvl 


+1 (triangle inequality) 


=0(0)+1 m 


We can now restate our requirements in terms of open-loop principal 
gains: 
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(1) Sensitivity 
1 


IN 


1 
— .2, if o(GK)>1 
o(GR)=1 (by Lemma 3.2, if o(GK)>1) 


2 


1 ; 
ZGK) (if o(GK)>1) 


that is, low sensitivity is ensured by making ¢(GK) large. Note how similar 
the required calculation is to that for SISO feedback: for example, 


required sensitivity <0.01 = ¢(GK)>100 (approximately) 


(2) Noise propagation 
o[I—(1+GK)~*]=a@[(1+(GK)~1)~*] (matrix inversion lemma)? 
(3.47) 
<<. 
a(l +(GKY +) 


So we want a(I+(GKY +) as large as possible and hence, by Lemma 3.2, we 
want o(GK)~* as large as possible or (GK) as small as possible. 

Again, note that if ¢(GK)«1 then é[J—-(I+GK)~!]~6(GK) (by 
Lemma 3.2), so that conversion from closed-loop to open-loop specifications 
is very easy. 


(3) Tracking of reference signal 


(For one degree of freedom.) We require that 
I—~(I+GK)"'xI or (1+GK)"!x0 
More precisely, we require that 


IQ +GK)*x| 
il x {| 


«1 for any x 


so 


G(I+GK)"'«1 


90 PERFORMANCE AND ROBUSTNESS OF MULTIVARIABLE FEEDBACK SYSTEMS 


Gain Lower bound for ¢(GK) 


/ 


Upper bound for ¢(GK) 


Loop bandwidth 


Figure 3.3 Typical specification for a multivariable return ratio. 


or 

a(I+GK)>1 
or 

a(GK)>1 (by Lemma 3.2) 
Conflicts 


Summary of open-loop requirements: 


(1) o(GK) large 
(2) (GK) small 
(3) o(GK) large 
(4) @(K) small 


Since 6(GK)2o(GK) there are obviously conflicts between (1 and 3) and 2. 
Furthermore, since (GK) <6(G)@(K), there are also conflicts between (1 and 
3) and 4. 

We can see from this that the conflicts that arise are exactly the same 
as those that arise in the design of SISO systems, and we try to resolve them 
in the same way — by making o (GK) large at low frequencies, and ¢(GK) small 
at high frequencies. 

A typical specification for the principal gains of the return ratio GK is 
shown in Figure 3.3. 
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(4) Control energy 


We know that the control energy required is determined by the matrix 
F;'K=(I1+KG)"'K 


To assess the ‘gain’ of this matrix from open-loop characteristics we can use 
the relation 


o(F; 'K)>o(F; ')o(K) 


_9@(K)___a(K) 
 6(F;)) 6(1+KG) 


_ G(K) 
~6(KG) 


(if ¢(KG)>1) 


So at low frequencies we can use the open-loop characteristics, but note that 
for this we need the principal gains of KG, not those of GK. In some 
circumstances these can be very different. 


3.5 Principal gains and characteristic loci 


In Chapter 4 we shall present a design method which is based on mani- 
pulating characteristic loci. In this section we show how the characteristic loci 
can be related to the principal gains. Without such a relation we would not 
know how to manipulate the characteristic loci, which in some circumstances 
can be very misleading indicators of performance and robustness. 


Definition 3.1: Let Q(s) be a square matrix. Then Q(s) is normal if 
Q"(s)Q(s) = Q(s)Q¥(s) 

Lemma 3.3: Suppose that Q(s) is normal and 
O(s)= W(s)A(s)W~ *(s) 


where A(s)=diag {/,(s)} and each 4; is an eigenvalue function of Q(s). 
Then 


W” (s)= W7 (s) 


(that is, the eigenvectors of a normal matrix form an orthonormal set). 
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Theorem 3.2: Let Q(s) be normal, with eigenvalues {/,(s)} and prin- 
cipal gains {o;(s)}. Then we can order the A; so that 


o,(s)=|A,()|, i=1,2,...,m (3.48) 
Proof: Let A(s)=diag {4;(s)} and X(s)=diag {o;,(s)}. Then 
Q(s)= W(s)A(s)W~ *(s) 


= W(s)A(s)W*(s) (by Lemma 3.3) 


= Y A(swisywh(s) 


i=1 


Now let 


Al) =p)", pieR, piz0 
and 


ys) = wis) ® 


Then 
Q= È, pyw 


= ¥(s) diag {p,(s)} W™(9) 


where 
Y(s)= W(s) diag {ei 
Hence 


Y(s) Y(s)= diag {e~ 5%} W#(s) W(s) diag {e!9} =I 
(by Lemma 3.3) 


and similarly 
Y(s) ¥#(s)=1 


Hence Q(s)= Y(s) diag { p;(s)} W4(s) is a singular-value decomposition 
of Q(s). But the singular values of Q(s) are unique, so 


diag {p,(s)}=2(s) (up to ordering) 
that is, 


pi(s)=4;(s) = 
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This theorem shows that we can assess closed-loop performance from 
the moduli of the characteristic loci, or characteristic gains, if the return ratio 
is normal. 

This result is of more importance to design than to analysis. We can 
assume that the designer can easily obtain displays of the principal gains of 
any proposed design. One point of the theorem is that if he wants to 
achieve particular principal gains, one way of doing it is to design K such that 
GK is normal and has particular characteristic gains. 

Another reason for normal return ratios being desirable is that the 
sensitivity of the eigenvalues of a matrix to perturbations of its elements is 
minimized if the matrix is normal (Wilkinson, 1965). Knowing that one’s 
model of the plant is bound to be inaccurate, one hopes that the use of a 
controller which makes the return ratio normal also reduces the likelihood of 
the closed-loop being unstable when the compensator is used with the real 
plant, since stability is determined by the characteristic loci. This argument 
should be applied carefully, because it is possible for relatively small changes 
in the plant to produce relatively large changes in the return ratio, and 
consequently large departures from normality. However, this seems to occur 
only for plant with a high condition number, and it can be argued that such 
plants are in any case inherently difficult to control. 

Even when the return ratio is not exactly normal, the characteristic 
gains approximately equal the principal gains as long as its eigenvectors are 
nearly orthogonal. When this is not the case, the return ratio is said to be 
skew. 

The following measure of the skewness of a square matrix G has been 
proposed by Hung and MacFarlane (1982). First, a so-called Schur decompo- 
sition of G is obtained: 


G=S(D+T)s# (3.49) 


where D is diagonal, T is strictly triangular and S is unitary (see Section 8.2 
for details of how to compute this). Then a suitable measure of skewness is 


ms (G)= = R (3.50) 


where ||. {| denotes the Frobenius norm 
IX le=s/ {tr(X*#X)} 

and tr(.) denotes the trace of a matrix. This measure has the property 
0<ms(G)<1i (3.51) 


with 0 denoting complete normality and 1 denoting complete skewness. 
Hung and MacFarlane (1982) show that ms(G) can be related to the 
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sensitivity of the characteristic loci of G, while Pang and MacFarlane (1986) 
show that it can be related to the divergence between the principal and 
characteristic gains as follows (with the same notation as in the proof of 
Theorem 3.2): 


mon ffi a( Se co 


When the skewness of a matrix is appreciable, we can assert only that 
G<|A;|<o (i=1, 2,..., m) 


namely that each characteristic gain has a value lying between the smallest 
and largest principal gains. (This follows from (3.13) by letting u be an 
eigenvector.) 


3.6 Limitations on performance 


We have seen that the performance of SISO feedback loops is limited by the 
Bode gain—phase relationships: an ideal gain characteristic would be as 
shown in Figure 3.4. Such a gain characteristic is not realizable, and any close 
approximation would be accompanied by an excessive phase lag at the cross- 
over frequency, and so would inevitably lead to an unstable loop. 

We have also seen that, if the plant to be controlled is non-minimum- 
phase (that is, if it has one or more zeros in the right half-plane), then the 
achievable gain—bandwidth product is limited and, in practice, the achievable 
bandwidth is usually limited. 

To what extent do these limitations apply to multivariable systems? 


3.6.1 Gain—phase relationships 


Suppose that we have a performance specification which translates into open- 
loop principal-gain specifications as shown in Figure 3.3. For w<a,, 


Loop gain 


Oy 


Figure 3.4 Ideal, but unrealizable, open-loop gain characteristic. 
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a(GK)>L, and usually L>1. Now, 
a(GK)<|4,(GK)| 


where {A,(GK)} is the set of eigenvalues of GK. So, 
Idet (GK)|= Į | |A(GK)|>L” (3.53) 
i=i 


Similarly, for œ > %2, 
|det (GK)|<e”™ (3.54) 


Now, assuming that det(G(s)K(s)) is an analytic function, we can 
apply Bode’s gain—phase relationship (1.36) to it: suppose that det(GK) is a 
minimum-phase function, and that det (G(0)K(0))>0. Then, if log |det (GK)| 
decreases linearly with log w (w, <w<a,j), 


nm mlog(L/e) 
2 log(@2/@,) 


arg det(GK)< (approximately) 7 (3.55) 


over most of this frequency range. If det(GK) is non-minimum-phase then 
the phase of det (GK) will be more negative than this (by the usual argument). 
Now, 


arg det(GK)= } arg 4;(GK) 
i=1 


so 


x mlog(L/e) 
(GK)< 
g MEO E YS 9 log (sla) 


M5 


(3.56) 


i 


Closed-loop stability is determined by the number of encirclements of 
the point — 1 +j0 by the 4;. Suppose that GK is open-loop stable, so that for 
stability we require no encirclements. To see how large L/e can be while 
maintaining stability, let us make the most favourable assumption about the 
phases of the characteristic loci—that the greatest phase lag of any character- 
istic locus should be as small as possible (for each w). Obviously this happens 
if all the loci have the same phase- that is, if 


arg A,(GK)< -> log (L/e) 


2 tog (w,/a,) ory 


for each i. 
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If we try to reduce the lag of any one locus, then the lag of some other 
characteristic locus is bound to increase, so that the total phase lag of 
det (GK) is maintained. 

The relation (3.57) between the most favourable values of {arg 4;}, L/e 
and w,/q, is the same as the relation between the phase and gain of SISO 
return ratios. We can therefore deduce the same performance limitations as 
for SISO loops: if 


20 log, 9(Z/s) 


> 40 dB/decade 
log 1o(@2/@,) / 


and log, 9(w/@,) is quite small (<2, say), then closed-loop stability will 
be difficult to achieve, and may be impossible. Whatever the value of 
log,9(@2/@,), unconditional stability will not be possible. 

As for SISO loops, things are worse for open-loop unstable and/or 
non-minimum-phase systems. 


3.6.2 Right half-plane zeros and poles 


The question arises of whether individual loci obey the Bode gain-phase 
relationships. This has been investigated by Smith (1982), who found that if 
the eigenvalue function of the return ratio has no branch points in the right 
half-plane, then the individual loci do obey the gain-phase relationships. (A 
necessary, but not sufficient, condition for s=sp to be a branch point is that 
A(G(S9)K (So)) = 4,(G(So) K (So) for some pair (i, j ).) What happens if there are 
right half-plane branch points is still not understood, but violations of the 
gain-phase relationships are not observed in practice. (And we know, 
of course, from Section 3.6.1 that it is impossible for all the phases of the 
characteristic loci simultaneously to be more positive than they would be if 
they obeyed the gain—phase relationships.) 

Let us assume that the individual loci obey the gain—phase relation- 
ships, and that GK has a zero in the right half-plane (and that det (GK) has 
too). Then at least one of the eigenvalue functions has a zero in the right 
half-plane, since det(GK)=1;/,, and will therefore be subject to the 
gain-bandwidth limitation which is imposed on SISO loops by right half- 
plane zeros. 

But ¢(GK)<|4,(GK)|, so the same limitation is imposed on the open- 
loop gain—-bandwidth product of the multivariable loop, if this is measured by 
the smallest principal gain ¢(GK). 

If GK has a pole in the right half-plane then it is probably true that at 
least one characteristic gain must exceed unity up to some minimum fre- 
quency, which is determined by the pole location in the same way as for SISO 
loops. If this is true then (GK) must also exceed unity below this frequency, 
and hence the loop bandwidth cannot be much lower than this frequency (by 
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Lemma 3.1). This has not been proved, however, and the argument cannot 
parallel the SISO argument completely because the eigenvalue functions of 
GK are almost always irrational. 


3.7 Transmission of stochastic signals 


In Section 3.3 we derived qualitative closed-loop requirements for principal 
gains. In Section 3.4 we found how to convert both qualitative and quantitat- 
ive closed-loop specifications into open-loop specifications. We now state 
some results which can be used to determine closed-loop specifications 
quantitatively, if there are stochastic signals in the loop. 

We shall assume that all signals are zero-mean, stationary stochastic 
processes. We shall use {x(t)} to denote such a process, and x(t) to denote 
a single sample function from such a process. E{x} will denote the mean 
value of the process (which will always be zero, by assumption), E {xx} 
will denote its covariance, and so on; ®,,.(«) will denote its power spectral 
density, namely the Fourier transform of its autocovariance function 
Pxx(t) = E{x(t)x"(t+7)}. 

The performance of feedback systems which contain stochastic signals 
is often best measured in terms of the variances of the various signals. 
Typically, the variance of the error between the reference input and the 
output, and the variance of the plant input signal, are of interest. Since we are 
dealing with vectors of signals, it is mathematically convenient to assess 
performance in terms of the sum of the variances of the component signals; 
so if {x(t)}} is a vector-valued process, a measure of its ‘signal 
strength’ (more accurately, its power) is tr(E{xx™}). Using the fact that 
tr(AB)=tr(BA), and the properties of the expectation operator E{.}, we 
obtain the result that 


tr(E{xx"})=E{x™x} (3.58) 


For SISO systems, if y(s)=G(s)u(s) then the power spectral densities 
of y and u are related by (Papoulis, 1984; Priestley, 1981) 


®,,(@) =|G(jo)|? ®,,,(@) (3.59) 
Also, we have 
2 1 = 
E{y*}= zx |. ®,,(@) da (3.60) 


Thus, by knowing ©,,(@) and |G(jw)| we can compute E{y7}, and we can 
use graphical displays to identify any frequency band at which most of the 
contributions to E {y?} are concentrated. 
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These relations can be generalized so as to apply to multivariable 
systems, but there are two complications. The first is that there is not just a 
single gain — we expect to use instead a band of gains, bounded by the smallest 
and largest principal gains. The second complication is that if y(t) is a vector, 
then a suitable index of performance will be, in general, E { y" Wy} rather than 
E{y'y}, where W is a weighting matrix which is real, symmetric (usually 
diagonal) and positive-semidefinite. 

It can be shown that 


E{y™ Wy} =5- i tr [W®,,(@)]do (3.61) 


and, if y(s)=G(s)u(s) and G(s) is stable, 

D, (0) = G(jo)®,, (0) G"(—joo) (3.62) 
The weighting matrix need cause no trouble, because 

tr[Wo,,] = tr[W'?0,, Ww] (3.63) 


=tr [W1 G(jo)®,,(@)G"(—jo)W*7] 
so 
E{y"Wy}=E{z"2} (3.64) 
where 
z(s)= W*™? G(s)u(s) (3.65) 


Thus we can confine ourselves to the case W=I. 
The following results can be shown to hold (Postlethwaite et al., 1981). 


Theorem 3.3: If y(s)=G(s)u(s), G(s) is stable, and the power spectral 
density of {u(t)} is ®,,,(@), then 


E{yTy} => is La? (Oi (oC jo)) de (3.66) 


This result is in a useful form for calculating total variances obtained with a 
particular design. But for guidance in design we would like a result in terms of 
the principal gains of G(s) alone. Such a result is: 


Theorem 3.4: If y(s)=G(s)u(s) and G(s) is stable, then 


(Oy) — 52(G(ja)) (3.67) 


(CGO) < FEO) 
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3.8 The operator norms ||G||, and ||G|,, 


So far we have considered the ‘gain’ of a transfer function only at individual 
frequencies. But it is possible, and useful, to have a cruder measure of the 
‘gain’ —a single number associated with the transfer function. We shall define 
two such measures, or norms, on G, and denote them by ||G||, and ||G||,, 
respectively. 


Definition 3.2: Let G(s) be a proper transfer function with no poles 
on the imaginary axis. Then 


IGla= | te | stooo} (3.68) 


and 


|G ||. =sup 6(G(ja)) (3.69) 


It can be shown that || G||, and || G || both satisfy the usual properties 
of norms, namely: 


||G||>0, with ||G]|=0 if and only if G=0 

loG||=|a| |G], for any scalar « 

|G+H||<|Gl|+] 4 
|G ||, also satisfies 

IGH |o SIG Ilo lH. 
but this inequality is not satisfied by ||G||,. These norms are often 
referred to as operator norms, as the system represented by a transfer 
function is an operator which maps functions—input signals—into 
other functions — output signals, and these norms measure the amplifi- 
cation (or at least the greatest possible amplification) of this mapping. 
We can be more specific about this: from equations (3.61) and (3.62) we 
see that, if ®,,(@)=TJ, y(s)=G(s)u(s) and G(s) is stable, then 

E{y'y}=||Gll2 (3.70) 


so that ||G||, gives us precise information about the power gain of G 
when the input is a white stochastic process. 
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0 
Figure 3.5 Reading off || G||,, from plot of largest principal gain. 


To see the meaning of ||G||.,, suppose that the input signal u(t) has 
finite energy, as measured by 


jéj | l | wuar} (3.71) 


(which is again a norm, this time on signals rather than systems), but that we 
have no other information about it. It is meaningful to ask what the greatest 
increase in energy is that can occur between the input and output for a given 
system. This is answered by: 


Theorem 3.5 (Vidyasagar, 1985; Francis, 1987): If lul <œ, 
y(s)= G(s)u(s) and G(s) is stable and proper and has no poles on the 
imaginary axis, then 


sup Wyle Glia (3.72) 


u llull 


Proof: omitted. 


Clearly both || G||, and ||G || can be related to the principal gains of 
G. For || G||, the relation is through the expression which appears in Theorem 
3.3, whereas for ||G || the relation is very simple. Figure 3.5 shows a plot of 
G(G(ja)) for some transfer function G, and indicates how ||G||,, can be read 
off from the plot, since it is simply the peak value of a(G). (An alternative way 
of finding ||G||,, from a state-space realization, which is more efficient if 
||G||,, is the only quantity being computed, is given by Boyd et al. (1988).) 

In applications these operator norms are applied to closed-loop 
transfer functions such as the sensitivity function S(s) and its complement 
T(s). They are generally used in connection with optimization problems; for 
example, the linear quadratic Gaussian (LQG) theory used in Chapter 5 is 
concerned with minimizing || Q ||, for a suitably specified Q, while the problem 
of obtaining the best possible sensitivity function is concerned with mini- 
mizing | S||,,. When performing such optimizations it is of course essential 
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that the transfer functions whose norms are being optimized remain stable, 
and this requirement gives rise to the following notation (which we shall not 
need here, but is becoming widespread in the literature). The set of all proper 
transfer functions G for which ||G||,,<0o is denoted by L,, (because it is a 
Lebesgue space), while the set of all transfer functions which are members of 
L, but which are also exponentially stable, is denoted by H „ (because it is a 
Hardy space). For a discussion of these spaces, and a more rigorous treatment 
of operator norms, see Vidyasagar (1985) or Francis (1987). 

We point out that the ‘peak M value’ M, which appears in classical 
control theory is, in the notation introduced here, the same as || 7'||,,, where T 
is the complementary sensitivity I—S. 


3.9 The use of operator norms to specify performance 


The kind of specifications we formulated earlier can be recast into bounds on 


|. |, by using suitable weighting functions. Suppose, for example, that we 
have the specification 


a(S(j@))<0.2, wx<l (3.73) 
and 
6(T(jo))<0.1, w 250 (3.74) 


We can refine this specification to that shown in Figure 3.6, in which the gain 
characteristics have been chosen to be those of simple transfer functions. The 


upper bound for the sensitivity can be approximated by the gain of the 
transfer function 


w,(s)=0.141(1 +5) (3.75) 


3 


Upper bound for g(S) 


Upper bound for ¢(T) 


Figure 3.6 Specifications which satisfy equations (3.73) and (3.74). 
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(in which the steady-state gain is 0.141, rather than 0.2, to allow for the 
departure from the straight-line approximation at w=1), and the upper 
bound for the closed-loop transfer function can be approximated by the gain 
of 


3.536(1 +0.02 
iios A - 2 (3.76) 
The original specification is then included in the new specification: 
Iwi Slas! (3.77) 
and 
lwz Tle<1 (3.78) 


The choice of w, and w, is somewhat arbitrary, of course, and many other 
choices would also include the original specifications. 

The purpose of stating specifications in this form is that a compen- 
sator can then be obtained automatically, by using the H,, optimization 
theory to be presented in Chapter 6. But problems arise which are similar to 
the problems discussed in Chapter 1 in connection with the idea of choosing 
S or T, and then deducing the required compensator. Here again we have the 
problem that non-essential details of the specification are being determined 
arbitrarily, and not by the plant’s characteristics. There is also the problem of 
checking whether the specification is consistent. If we recall that 


S+T=I (3.79) 


and that both S and T are determined by a return ratio which is constrained 
by the requirements of the generalized Nyquist stability theorem, then we 
realize that (3.77) and (3.78) may be inconsistent with each other. 

The difficulties of using ||. ||,, to formulate performance requirements 
have been explored in depth by Freudenberg and Looze (1986). 


3.10 Representations of uncertainty 


3.10.1 Unstructured uncertainty 


Classical feedback-system design dealt with the problem of plant uncertainty 
by prescribing stability margins, by means of specified gain or phase margins 
or specified peak M values. Implicit in the use of such margins was a rather 
crude model of the uncertainty to which the plant was subject. If the only real 
concern was to maintain stability, and the prescribed phase margin was 40°, 
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for example, then there was an implication that the plant model may be 
underestimating phase lag by as much as 40° at the cross-over frequency. 
Such a description of uncertainty is unstructured in the sense that it bounds 
the magnitude of possible perturbations, but does not trace the origins of the 
perturbations to specific elements of the plant. 

In this section we are concerned with making explicit models of 
unstructured uncertainty. For multivariable plant the three most commonly 
used models are as follows: let Go(s) be a nominal transfer function, which is a 
best estimate, in some sense, of the true plant behaviour, and let G(s) denote 
the true transfer function of the plant; then 


G(s) =Go(s)+A,(s) (3.80) 

G(s) = Go(s)[1 + A,(s)] (3.81) 
or 

G(s)=[1+A,(s)] Go(s) (3.82) 


where A, represents an additive perturbation, A, an input multiplicative 
perturbation, and A, an output multiplicative perturbation. 

The only restriction on the perturbations is on their ‘size’, which is 
measured by || A||,,. If we want to make the size frequency-dependent then we 
can use 6(A), or, equivalently, we can set A= W, À W,, where W, and W, are 
minimum-phase transfer functions (no poles or zeros in the right half-plane) 
which serve as frequency-dependent weighting functions. In this case we can 
always take ||A| wo <l. 

The additive model (3.80) may be used to pose some robust stabiliz- 
ation problems which have nice solutions (see Glover, 1986), but the multi- 
plicative models (3.81) and (3.82) are often more realistic, since ||A,||,, and 
| A, ||. represent relative rather than absolute magnitudes. For example, 
| A; ||. <0.1 implies that the size of the perturbation is at most 10% of the 
‘size’ of Go, since 


lG- Gollo = || GA; ll co 
<|| Golla lA; lo 
<0.1 || Golla (3.83) 
However, specifying || A, ||, <0.1 implies that 
| G-- Go lo = | Ag lle 
<0.1 (3.84) 


We need both models (3.81) and (3.82) because multiplication of 
transfer-function matrices is non-commutative. In effect, (3.81) pretends that 
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all the uncertainty occurs at the plant input, while (3.82) pretends that it all 
occurs at the output. Similarly, two weighting matrices (W, and W,) rather 
than one ensure that the disturbance bound can always be normalized to 


IWT AW leS! (3.85) 


EXAMPLE 3.1 (Skogestad and Morari, 1987) 


A process plant operates with certain flow-rates in the region of 
100 kmol min` +. Typically, changes of up to 10 kmol min™+ are required. 
These changes are effected by servo-controlled valves which rely on 
measurement of the flow, but the flow measurement is in error by 1% (see 
Figure 3.7). So when a change of flow-rate from 100 to 110 kmol min“! is 
required, an actual change to 111 kmol min™ + may occur. The error in the 
required change is therefore about 10%. Thus our plant model, which 
describes changes about some operating point, is subject to errors of up to 
10% on each input channel (assuming that each input is a flow-rate, 
regulated as described). Since the error on each input channel is indepen- 
dent of the others, a suitable representation of ‘model uncertainty’ is by 
means of (3.81), with 


A,=diag {6,}, |6,|<0.1 (3.86) 
which gives 


| 10A;||.<1 (3.87) 


Process plant 


C) Flow measurement 


Figure 3.7 Example given by Skogestad and Morari (1987) of uncertainty about 
flow-rate. 


In some cases it will not be appropriate to refer all the uncertainty to 
the input or the output. It is then necessary to use either the additive model 
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(3.80), or some other multiplicative model such as 


G(s)=[1+A,(s)]Go(s)LI + A,(s)] (3.88) 
or 
G(s) = Go(s).*[I + As(s)] (3.89) 


where <. +’ represents the Schur product, namely element-by-element multipli- 
cation. Among other models which have been proposed are the inverse 
multiplicative model 


G(s)=[I + Ag(s)]” *Go(s) (3.90) 
and a model based on matrix fraction descriptions: 


G(s)= N(s)D~*(s) (3.91) 


where 
N(s)=N (s)+An(s) and D(s) = Do(s)+Ap(s) (3.92) 


Models (3.91) and (3.92) can also be extended to so-called fractional 
representations, where N(s) and D(s) are allowed to be stable transfer 
functions rather than polynomial matrices. Such fractional representations 
will be treated in some detail in Chapter 6. 

A major advantage of using operator norms to describe model un- 
certainty is that it is not necessary to postulate the existence of a ‘true’ 
transfer-function model. Real plants are always non-linear, and frequently 
time-varying, so no ‘true’ transfer function description can exist. But we can 
interpret a model of uncertainty, such as (3.80), in the following way. Suppose 
that we apply an input u(t) to a plant, with |{u||,=1, and that the resulting 
output is y(t). Suppose also that the same input applied to a model with 


transfer function Go gives the output yo(t). Then (3.80) really says little other 
than 


ly—Yoll2<lAallo (3.93) 


3.10.2 Structured uncertainty 


In the above Example 3.1 on flow control, we had information about the 
structure of the input uncertainty: we knew that the uncertainty for each valve 
was independent of the uncertainty for the others. If there are two such 
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valves, a correct description of the uncertainty is 


=| o i |5|<0.1 (3.94) 


But when we write || 10A;||,,<1 instead, we lose all the structural infor- 
mation, since this description also allows perturbations such as 


0 0.1 
al 5 | (3.95) 
and 
feat 0.1 0.1 abe 
2,01 0.1 629) 


which do not correspond to any real perturbations. The use of the un- 
structured description generally leads to compensator designs which are 
unnecessarily conservative, because they perform satisfactorily (in some 
sense) even in the face of perturbations which can never occur. The controller 
ends up having to be ‘detuned’ just to guard against non-existent events. 

In practice both structured and unstructured information may be 
available about plant uncertainty. Particular parameters in a state-space 
model may be known to vary over known ranges, for example -which is 
certainly highly structured information—while at the same time it may be 
known that all the details of the model become highly inaccurate above a 
certain frequency, because of unmodelled lags, hysteresis, resonance, parasitic 
couplings and so on -this is unstructured information, since it would not be 
practical or possible to describe all these in detail. The question is how to 
incorporate both kinds of information simultaneously into an accurate 
description of the plant’s uncertainty. 

In practice it is always possible to represent information about 
uncertainty very accurately, in the following way. We consider the plant to 
have three sets of inputs and three sets of outputs, as shown by the block P in 
Figure 3.8. The first set of inputs consists of all manipulable control variables, 
and the second set consists of all other external signals, such as disturbances 
and measurement noise. The first set of outputs contains all the measured 
signals which are available to a feedback compensator, while the second set 
consists of any other outputs whose behaviour may be of interest (these need 
not be real signals which can be measured at some point in the plant, but may 
include conceptual quantities such as error signals). Figure 3.8 shows a 
feedback compensator K connected between the first pair of outputs and 
inputs, with its own external signals which represent reference or demand 
signals. 

The third set of inputs and outputs of P is novel. Wherever there is 
uncertainty about some part of the plant, we can imagine representing it 
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K 
Compensator 


P 


L A x 


Figure 3.8 A standard representation of an inaccurately known plant under 
feedback control. P and K are assumed to be known accurately, and any un- 
certainties have been ‘pulled out’ into the system A. 


‘locally’ by using one of the descriptions we employed in Section 3.10.1 -for 
example, if a parameter 0 has nominal value @, but an uncertainty of +20%, 
then 8=6,(1 +6) (|| <0.2). We can therefore envisage a block diagram of the 
plant with several A blocks scattered about in it. Now, each of these blocks 
has an input (which may be a scalar or a vector) and an output. We collect all 
the inputs together, and these form the third set of outputs of P; similarly we 
collect all the outputs together to form the third set of inputs to P. In this way 
all the A blocks are taken outside the plant; in Figure 3.8 they are all 
contained inside the block labelled A. Viewed as a transfer-function matrix, 
this external A has a very special structure: it is block-diagonal, and the 
blocks on the diagonal are just the small As which have been ‘pulled out’ from 
inside the plant. We write 


A(s)=diag {A,(s),.. . ,A,(s)} (3.97) 


if n such blocks have been ‘pulled out’. Note that each A j may bea scalar ora 
matrix. 

We can make one further refinement to this model: we insert scalings 
in the plant so that all the uncertainties are normalized to 


lA;lo <! (3.98) 


This is always possible, and helps in formulating standard conditions for 
achieving robust designs, as we shall see in later sections. Note that we obtain 
the nominal plant model by setting A = 0. 

If the compensator K is already known, then it can be amalgamated 
with the ‘certain’ part of the plant, P, to form a single system Q, as shown in 
Figure 3.9. It is clear from this that any uncertainty associated with the 
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Compensated plant 


Uncertainties 


Figure 3.9 As Figure 3.8, but with the plant Pand compensator K combined into 
the single system Q. 


compensator can be modelled in the same way as plant uncertainty, and 
results in the addition of some blocks to A. 


EXAMPLE 3.2 


Let us return to the process plant with the flow control valves which 
we considered in Example 3.1. For concreteness, let us suppose that 
the plant has two inputs and three outputs, and that its nominal model, 
excluding the servo-valve dynamics, is Go(s). Also suppose that each 
servo-valve has a nominal transfer function M%(s), and each has a 10% 
multiplicative uncertainty, as already described, which is present at all 
frequencies. But now we shall also assume that the plant has unstructured 
multiplicative uncertainty A, at its input, as shown in Figure 3.10, which 
comes into effect at high frequencies. Suppose that A, is to have almost no 
effect below a frequency wo, but that thereafter its effect is to increase so 


15,|<0.1, |6,| <0.1 
o{As)<ir(s)| 


Valves Plant 


Figure 3.10 A combination of structured and unstructured uncertainty at the 
input of a two-input, three-output plant. 
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Figure 3.11 Gain characteristic of r(s) defined by equation (3.101). 


that G(Go(I+A)) is bounded by a constant gain, even though (Go) 
decreases to zero at high frequencies. If 


a(Go(jæ)) < < (3.99) 


for large enough (that is, the plant exhibits at least ‘first-order roll-off ’) 
then a suitable characterization of the unstructured uncertainty may be 


g(A;) < |r(s)| (3.100) 


r(s) = [1-»,(2)](1 +2) (3.101) 
0 0 


and b;(s) is the transfer function of a normalized third-order Butterworth 
filter. This gives a sharp increase from zero at frequencies just below wo, 
followed by a gradual (first-order) increase thereafter, as shown in 
Figure 3.11. The bounds on 6, and ô, are |ô | <0.1 and |ô | <0.1, as 
before. Figure 3.10 can be redrawn as Figure 3.12, which is now in the 
general form described above, with 


where 


JE 
0 8, 
Nahe L A e et (3.102) 
0 0: 
0 0 f A, 


lâs, |d,|<1 (3.103) 
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Plant 


\5, | < 1,.|52| < 1, |Asll.x <1 


Figure 3.12 The system of Figure 3.10 redrawn in the form of Figure 3.8. 


and 


a(A;) <1 (3.104) 


Note that scalings have been inserted inside the plant to normalize the 
uncertainty bounds, and that (3.104) can be rewritten as 


(3.105) 


As ilo <1 


From this example it should be clear that the representation of 
uncertainty shown in Figures 3.8 and 3.9 is both general and accurate, in 
that it allows a wide variety of structured uncertainty models to be described.. 
There is one respect in which the representation is inaccurate, however. The 
bounds 


Ajllao <1 
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allow A, to contain complex elements. In many cases, such as the valve-gain 
uncertainties 6, and 6, in Example 3.2, it is known that only real elements 


can occur. To capture this accurately, a more elaborate description of A; than 
(3.98) is required. 


3.10.3 Uncertainty templates 


A straightforward and accurate representation of structured uncertainty is 
obtained by computing the set of possible frequency responses of a plant at 
each frequency. This set can be displayed as a region on each element of a 
Nyquist array. Such regions are often referred to as templates, because if they 
are displayed on Nichols charts the design of a compensator at a given 
frequency can be viewed as a process of sliding the uncertainty template 
around the Nichols chart, its shape unchanged, until every point of it satisfies 
the specifications at that frequency (Horowitz, 19 82).° 

In practice, of course, each region is defined by a finite set of frequency 
responses, this set being generated by varying some model parameters over 
ranges which define the extent of the uncertainty. In theory there could 
be considerable difficulties in determining the boundaries of the regions 
in this way, since these boundaries do not in general correspond to the 
boundaries of the expected parameter variations. (Consider templates for 
[s* + (a—2)s + 4]~', for example, if 0 < a < 10.) But in practice it appears to 
be possible, with some effort, to determine the uncertainty templates, at least 
if only a few model parameters are allowed to vary. This is probably so 
because designers have other sources of information about particularly 
troublesome parameter values—for instance, values which cause a severe 
resonance to occur may be discovered by experience even before the math- 
ematical model of the plant is developed. 


3.11 Stability robustness 


Now that we have some means of describing the extent of our uncertainty 
about a system model, we turn our attention to ways of checking whether a 
particular feedback design is robust in the face of this uncertainty. The 
fundamental property which must be retained for all possible perturbations 
of the plant (and of the compensator) is stability of the feedback system. 


3.11.1 Unstructured uncertainty 
Consider the additive model of unstructured uncertainty defined by (3.80) 
G = Go + A, 


with 2 z 
A, > W, A, W, and l A, | 90 <s 1 
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Figure 3.13 Feedback loop containing plant with additive uncertainty. 


Suppose that we design a compensator K which stabilizes the nominal model 
Go. Then the loop with the actual plant is as shown in Figure 3.13, and this 
can be redrawn in the standard form, with A, appearing in a feedback path, as 
in Figure 3.14. Note that in both figures the dashed line encloses a stable 
system. 

If we write 


y v Qi Qi | | 
~{=Q} ./= x 3.106 
H of;] Fa Q22 JLZ ( ) 
where u, v, y and Z are as defined in Figures 3.13 and 3.14, then it is easy to 
show that 


Qi = Go K(I+GK)` t} (3.107) 

Q2 =(1+GK) ‘W, (3.108) 

Q21 = Wa (I + KGo) `K (3.109) 
and 

Qa = —W,(1+ KG) ‘KW, (3.110) 


To check whether the feedback system will remain stable under 
permissible perturbations, we need to check whether the feedback combina- 
tion of Q,, and A, will remain stable, for all allowable A,. Now, we know that 
Q,, is stable (since K stabilizes Go), and we assume that A is stable. Then the 
feedback system can become unstable only if one or more of the characteristic 
loci of — Q,,A, encircles the point — 1; we know this from the generalized 
Nyquist theorem (see Chapter 2). Now, if 4(Q,,A,) is any eigenvalue of 
Q32 A,, then 


|2(Q22 Aa)! < p(Q22A,) < G(Q22A,) (3.111) 
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Figure 3.14 The feedback loop of Figure 3.13 redrawn in the form of Figure 3.9. 
where p(.) denotes the spectral radius, and so no encirclement of —1 can 


occur if 


&(Q2,A,) <1 (3.112) 


at each frequency, or equivalently, if 


]O22As lla <1 (3.113) 
Now, 
Og As llo < ll Q22 beaches (3.114) 
and 
Kalla <1 
SO 
Qa2llo < 1 (3.115) 


is certainly a sufficient condition to guarantee that instability cannot occur 
for any of the possible perturbations. For a given compensator design, this 
condition can be checked easily by displaying the principal gains of Q,,, and 
seeing whether the greatest one ever exceeds unity. The quantity (|| Q33 llo) * 
is sometimes known as the gain margin excess (assuming that (3.115) is 
satisfied). It is often informative to display a plot of [¢(Q,,)]~' against 
frequency, in order to see the frequencies at which stability is most vulnerable. 

If all possible perturbations can occur for which || A, lle <1 holds, 
then the condition (3.115) is necessary as well as sufficient for robust stability. 
For suppose that 


|Q22Il,=1 +e (3.116) 
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for some € > 0, and let Q,, have the singular-value decomposition 

Q2(s) = ¥(s)Z(s) U"(s) (3.117) 
The perturbation 

A,(s) = —e7~*? U(s) Y¥(s) (3.118) 
may occur (where D is a real scalar), and is permissible, and leads to 

—Q(s)A,(s) = e7*” ¥(s)X(s) Y” (s) (3.119) 
which is a normal return ratio. Hence 


| Amax(Q22(j@o)A,(j@))| = F(Q22( ja) (3.120) 
=] +e 


for some wg. Now, as we can choose D to be as large as we like, so as to make 
arge J°°? as negative as we like, we can certainly choose D to be large 
enough to force at least one clockwise encirclement of — 1 by the character- 
istic loci of —Q,,A,. (From (3.119) we = that the characteristic loci are 
given by —e i®? ¢;(Q,,(ja)), since Y” = Y~ +.) In other words, we can find a 
destabilizing perturbation if (3.116) holds. 

A crucial assumption which we made at the beginning of the deriva- 
tion above is that A, is stable. This means that the plant’s unstable poles are 
assumed to remain fixed as it varies over its region of uncertainty. If we start 
instead with one of the multiplicative descriptions of uncertainty, such as 
(3.81) or (3.82), the whole of the preceding derivation still holds, so that we 
again end up with the condition (3.115), the only difference being that the 
transfer function Q is changed (see Exercise 3.8). But in each case we need to 
assume that the perturbation is stable, which amounts to assuming that 
unstable poles remain fixed. If this assumption is too unrealistic we can use 
(3.90), which allows the creation of new unstable poles, even if Ag is stable, or 
the even more exotic 


G = Go + Ap Go) t (3.121) 


which allows unstable poles to move in the plane even if Ap remains stable. 
Both (3.90) and (3.121) can be treated in exactly the same way as we treated 
the straightforward additive perturbation (3.80), again obtaining condition 
(3.115), but each time with Q,, a different transfer function. 

For some uncertainty models we can relax the assumption about the 
stability of the perturbation. Consider the output multiplicative model (3.82). 
We shall now allow A, to be.unstable, but impose the alternative assumption 
that the nominal plant Gy and the perturbed plant G have the same number of 
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unstable poles. The argument is now slightly different from the previous one. 
Since the loop is stable for A, = 0, then by assumption, and since G and Go 
always have the same number of unstable poles, the loop will remain stable 
provided the number of encirclements of — 1 by the characteristic loci of GK 
remains unchanged, which it will if no locus passes through — 1 as G varies; in 
other words 


det [I + G(j@)K(jw)] #0 (3.122) 
for any w and any permissible G. (Readers who remember the derivation of 
the generalized Nyquist theorem will note that we could have come directly 
to this statement.) Now, (3.122) is the same as 

clI + G(jo@)K(jw)]>0 (3.123) 
for all @, which is the same as 

g[1+G,K+A,G)K]>0 (3.124) 


or 


a{[(A,GoK)-*+A>1+I]A,G,K}>0 (3.125) 
which holds if 

g{[(GoK)"'+I]Az'+1}>0 (3.126) 
By Lemma 3.2, this will hold if 

a{[(GoK)'+1]A,*}>1 (3.127) 


which is implied by 


o(As')a[(GyK)"' +I] >1 (3.128) 
Hence 

G(A,)o{[(GoK)-' +1} 4} <1 (3.129) 
or 

&{GoK[I + GoK]“}} <a (3.130) 


Note that this condition is essentially the same as that obtained by 
using the previous approach (see Exercise 3.8), although the set of allowed 
perturbations is rather different. We have established the sufficiency of (3.130) 
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for robust stability, but it can again be shown to be a necessary condition, if 
all permissible perturbations may actually occur. 

Stability conditions such as (3.115) and (3.130) are special cases of a 
very general result, known as the small gain theorem, which states that a 
feedback loop composed of stable operators will certainly remain stable if the 
product of all the operator gains is smaller than unity. This is true for non- 
linear as well as linear operators. 

Postlethwaite et al. (1981) introduce a ‘small phase theorem’, which 
may be useful when the phases of perturbations, rather than their gains, can 
be bounded. 


3.11.2 Structured uncertainty 


We now consider structured uncertainty, as represented in Figure 3.9; A has a 
block-diagonal structure, each block A, satisfying || A; llo < 1. We shall also 
assume that each A, is stable. Partitioning the compensated plant Q as before, 
the condition found earlier, namely 


Q22 Ilo <1 (3.131) 


is certainly sufficient for robust stability. But now it is no longer a necessary 
condition, because most perturbations which satisfy || A || œ < 1 are no longer 
permissible: the only permissible ones are those that have the appropriate 
block-diagonal structure. Furthermore, (3.131) can, in some cases, be so 
conservative as to be useless. Therefore robustness tests are needed which are 
better indicators of necessary conditions than is (3.131). 

Let BD; denote the set of stable, block-diagonal perturbations 
with a particular structure, and with ||A,||,, <ô. To describe this set fully 
for each design problem we should really write something like 
BD,(m,,m,.. - ,M,,k,,k2,.--,k,), to denote that A contains m = }%_; m; 
blocks, each block being repeated m; times and having dimensions k; x k;. 
(Repeated blocks allow some correlated perturbations to be modelled.) But 
we shall usually employ the abbreviated notation, with the appropriate 
structure understood to be fixed.® 

A permissible perturbation destabilizes the system if and only if 


det [I — Q,,(jw)A(jw)] = 0 (3.132) 
for some w and some AeBD,. We therefore define 
0 ifdet[I— Q24] #0 for any AeBD,, 
u(Q22(j©) = eam (a (A(jo)): (3.133) 


co] 


det [I — Q,,(jw)A(jw)] =0}7* otherwise 
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We also define 


|| Q22 Il, = sup #(Q22(jo)) (3.134) 


which is suggestive but rather misleading, since ||. ||, is not a norm. The 
function u(Q) is called the structured singular value of Q, and depends on the 
structure of the set BD, as well as on Q. 

Although (3.133) provides a definition of y(.), it gives no help in 
computing it. Doyle (1982) has proved a number of properties of u(.), the 
most important of which, for us, are as follows: 


H(aQ) = |a| u(Q) . (3.135) 
u(1)= 1 | (3.136) 
u(4B)< 6(A)u(B), A,B both square (3.137) 
u(A)= a(A), AeBD, (3.138) 
If n= 1 and m, = 1 (see above) then u(Q) = 4(Q) (3.139) 
If n= 1 and k, = 1 (see above) then u(Q) = p(Q) (3.140) 


(namely the spectral radius, max |A,(Q)|) 


Let D be any real, diagonal, positive matrix with the structure 
diag {d; Ips. + + sdm, Iris dmi +1 trs- ++ o dmden} (3.141) 
and d; > 0, then 
u(DQD~*) = u(Q) (3.142) 


Let U be any unitary matrix (UU = I) with the same block-diagonal 
structure as the set BD,. Then 


max p(UQ) < „u (Q) < inf ¢(DQD~") (3.143) 
U D 


Most of these results can be proved quite easily (see Exercise 3.9). 
The importance of u(.) can be seen from the following central result: 


Theorem 3.6 (Stability robustness theorem— Doyle et al., 1982): The 
system shown in Figure 3.9 remains stable for all AeBD, if and only 
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if 
|Qo2ll, <1 (3.144) 


Proof: 


sup p(QA) < sup u (QA) (by (3.143)) 


< sup[u(Q)o(A)] (by (3.137) 


< |Oll, (since (A) < 1) 


So, if ||Q,.||,< 1, then p(QA) <1, and I—QA cannot be 
singular (on the imaginary axis). Hence the system remains stable. 
Conversely, if || Q22 ||, > 1 then u(Q22(j@o)) > 1 for some ap and so, 
by definition (3.133), there exists a perturbation Aj¢BD,, such that 
det [I — Q22(j@o)Ao ] = 0 and g(Ay) <1. Hence the system is un- 
stable, and A)y¢BD,. This proof relies on the fact that || A; ||, < 1 if 
and only if || A || < 1 (see Exercise 3.9). a 


Theorem 3.6 does not really provide a test for stability robustness, 
because we do not have a means for computing || Q32 ||,,. However, (3.143) 
provides the basis for some computable approximations to ||Q,, ||,,. Doyle 
(1982) shows that the first inequality in (3.143) can be strengthened to 


man pl) = #(Q) (3.145) 


but the maximization problem here is likely to lead to local maxima being 
found, and so is not reliable. In fact the second inequality in (3.143) is a more 
useful starting point, particularly as it is dangerous to underestimate u, but 
safe to overestimate it. 

For n <3 and m; = 1 (i=1,...,n)—that is, if there are no more than 
three blocks in A, with no repetitions— Doyle (1982) has shown that 


u(Q) = inf ¢(DQD~") (3.146) 


So, if we have up to three uncorrelated perturbations, then (3.146) holds. Note 
that the size of each block A, is not limited here. For n> 3, numerical 
experiments indicate that the lower and upper bounds in (3.143) are usually 
within 5% of each other, and almost always within 15%. Thus the evidence 
is that inf¢(DQD~') is a useful estimate of u, and it is therefore worth 
developing ways of computing, or at least estimating, this infimum. 
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The obvious approach is to use hill-climbing to find the D which 
minimizes g(DQD~') at each frequency. This is quite attractive, as 
g(DQD *) can be shown to be convex in D (Safonov and Doyle, 1984), and 
since the search is over only m — 1 parameters, regardless of the dimensions of 
the blocks in A. (The search is over d,,d,,...,d,,, one of which can be fixed 
arbitrarily.) Doyle (1982) shows how to generate descent directions for such 
hill-climbing; the algorithm is rather involved, and simpler alternatives are 
worth considering. The basis of a completely different algorithm for com- 
puting the structured singular value is given by Fan and Tits (1988). 

Safonov (1982) has given a simple way of estimating info(DQD~*) for 
k; = | (i= 1,...,n)—that is, when each perturbation is a scalar, but may be 
repeated. He has shown that, in this case, 


inf ¢(DQD~') < infé(abs(DOD~")) = 4(0) (3.147) 


where [abs(X)],; = |x,;| and Ap(.) denotes the Perron-Frobenius eigenvalue 
of a matrix (see Chapter 2). The D which minimizes 6 (abs (DQD~')) is given 
by 


D=diag{d;}, i=1,...,n (3.148) 
d; = V (y./x:) (3.149) 
where x =(x,,...,X,) and y=(y,,...,y,) are the right and left Perron- 


Frobenius eigenvectors of Q, respectively. Note, however, that the estimate 
(3.147) can be obtained without finding D. 
This result can be extended to the general case, which allows k; > 1, as 


follows. Let Q be partitioned conformally with the block diagonal structure 
of A: 


Q=]: (3.150) 


(which should not be confused with the partitioning of (3.106), as the Q there 
is different; Q in (3.150) corresponds to Q22 in (3.106)). Now define the matrix 


(Q11) --. F(Qim) 
norm (Q) = : : (3.151) 


(Qmii) Sie F(Qmm) 
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Then 


infs(DQD~*) < Ap(norm (Q)) (3.152) 
D 


where the infimum is over the matrices D with the diagonal structure defined 
in (3.141). However, this algorithm is of limited usefulness since a better 
estimate of || Q ||, is often provided simply by a(Q) (if k; > 1). 

Another way of estimating info(DQD~ 1), but for k; = 1 only, is to use 
the algorithm put forward by Osborne (1960), which finds 


D* = argmin)\o7(DQD ') (3.153) 
D i 
and to take ¢(D*QD*~') as an estimate. 

Another useful estimate has been obtained by Kouvaritakis and 
Latchman (1985), but for a different class of perturbations. Suppose that A 
does not have any particular diagonal structure, but that each element of Ais 
bounded by 

lô; (jo)| < r:(@) (3.154) 
where r;;(œ) > 0. If A is an mxm matrix, this uncertainty description can 
also be represented by a purely diagonal perturbation, but of dimensions 


m? x m?; the bounds (3.154) therefore define a structured perturbation. Let R 
be the positive matrix 


R=|: (3.155) 


Now let x and y be the right and left Perron—Frobenius eigenvectors of 
Rabs(Q), respectively, and let u and v be the right and left Perron—Frobenius 
eigenvectors of abs(Q)R, and define 
D, = diag {,/(yi/x;)}, i=1,...,m (3.156) 
D, = diag {./(u;/v;)}, i=1,...,m l (3.157) 
Now, for any positive diagonal D, and D3, we have (see Exercise 3.10) 
p(QA) < &(D,RD,)G(Dz* QD; ") ; (3.158) 
so that a sufficient condition for stability of the perturbed system is 


6(D,RD,)6(Dz!QD;"')<1 (3.159) 
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Now suppose that the uncertainty description is rewritten in terms of a 
diagonal perturbation A, so that the corresponding compensated plant 


transfer function is Q, with scaling introduced so that | 5:;| <1 
(see Exercise 3.10). Then (3.159) implies that 


is 6(D,RD,) 
)) s — (3.160) 
mQ) a(D,Q +D) 
and since this holds for any positive diagonal D, and D,, we have 
5 a(D,R 
Oem ee G.161) 


Di, D2 a(D,Q~*D,) 


(3.156) and (3.157) give D, and D, which approximately achieve the infimum. 
A tighter estimate of u(Ô) can be obtained by finding D; and D, which 
actually do achieve it, but this requires a more elaborate algorithm based on 
hill-climbing. Such an algorithm is described by Kouvaritakis and Latchman 
(1985), and Daniel et al. (1986). 


3.11.3 Loop failures and gain variations 


An important property of a multivariable feedback system is its stability 
robustness in the face of one or more loop failures. This is sometimes called 
the integrity of the system. With a SISO system the position is straight- 
forward: if the feedback loop is broken because of some equipment failure, 
then stability is determined by the open-loop stability of the plant and 
compensator. But it is quite possible to have a multivariable plant and 
compensator, each of which is open-loop stable, and which together comprise 
a stable feedback system when everything is working correctly, but which will 
be unstable if one or more loops fail, perhaps as a result of a sensor or an 
actuator failing. 

This is an important special case of the more general question of 
stability robustness in the face of frequency-independent gain variations 
anywhere in the system. Clearly, such questions can be investigated by 
the methods presented in Section 3.11.2, with A a diagonal frequency- 
independent matrix. These methods are likely to be unnecessarily conserva- 
tive, however, since they will check for robustness in the face of complex 
‘gains’ as well as real ones, and in the face of gain inereases as well as 
decreases. 

An exact assessment of integrity against loop failures can of course be 
obtained by examining the characteristic loci for each possible failure mode, 
but if there are m loops, and any combination of these can fail, then there are 


122 PERFORMANCE AND ROBUSTNESS OF MULTIVARIABLE FEEDBACK SYSTEMS 


displays to be checked. This is feasible, although tedious, for small numbers of 
loops (six displays if m = 3, fourteen displays if m = 4), but gives no informa- 
tion about the effects of variations of gains between their nominal values and 
zero. Such variations may occur if equipment fails gradually rather than 
abruptly. 

A better, albeit conservative method of assessment is obtained by 
using a Nyquist array with generalized Gershgorin bands. As usual, let us 
represent gain variations in the form of Figure 3.9, with the uncertainty 
described by 


A — diag 1815095: . Ôm} (3.162) 


each 6; being real. Now consider the Nyquist array of Q,, (with Q partitioned 
as in (3.106)), with its Gershgorin bands superimposed. Since Q,, is itself 
stable, Theorem 2.13 tells us that the feedback loop will remain stable 
whenever the total number of encirclements of the points {1/6;}, each by the 
ith Gershgorin band, is zero. (Note that we look at 1/6;, not — 1/ô;, since in 
Figure 3.9 we have used a positive-feedback convention.) Usually this will be 
the set of values of ô; for which none of the points {1/6,} is encircled by the ith 
Gershgorin band. The conservativeness of this assessment can be minimized 
by applying various similarity transformations to Q,,, including the Mees 
scaling described in Section 2.10. It may also be reduced by choosing another 
representation of the gain variations: for example, the additive representation 
k; = ky; + 6; may allow a larger range of values of k; than the multiplicative 
representation k; = ko;(1+6;), since Q,, will be different in each case. 


EXAMPLE 3.3 


A two-input plant G(s) is placed in a negative-feedback loop with a series 
compensator C(s). The two input actuators are modelled by frequency- 
independent gains kj, and ky. Is the system robustly stable against 
the failure of either loop? Modelling the actual actuator gains by 
k; = ko;(1 + ôĉ;) leads to the arrangement shown in Figure 3.15, with 


Ox. = —diag {ko1,ko2} CG[I + diag {koi koz} CG]! (3.163) 


Nominal actuator gains correspond to 6; = 0, while complete actuator 
failures correspond to 6; = — 1. Ifthe generalized Gershgorin bands of Q23 
are as shown in Figure 3.16 (drawn for positive frequencies only), we can 
deduce that the system remains stable whenever 


1 1 1 
(-2 <5 < -09 or a i 0.1 or => 08) 


and (-0<5-<-2 or E PER or 5 > 015 | 
82 92 82 (3.164) 
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Figure 3.16 Generalized Gershgorin bands of Q,,, as defined by equation 
(3.163), for the system shown in Figure 3.15. 


or, more simply, whenever 
(-1.11<6,<1.25 or —10 < ô < — 1.25) 


and (—0.5 <ô, < 1.33 or ô,< —1.25) (3.165) 


This system therefore remains stable if actuator 1 fails, but cannot be 
guaranteed to remain stable if the gain of actuator 2 falls below 0.5 kọ2. - 
Note that (3.165) gives information about simultaneous gain variations: 
we can deduce, for example, that stability is maintained if k, = 0 and 
k, = 0.6 kọ2. 
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A different approach to robustness in the face of real parameter 
variations has been developed by Battacharyya (1987). 


3.12 Performance robustness 


In the previous section we were concerned with the robustness of stability in 
the face of various perturbations. In practice we are usually concerned with 
maintaining more than stability. We would like to maintain an acceptable 
level of performance, as well as stability, in the face of a prespecified class of 
perturbations—that is, we would like to achieve performance robustness as 
well as stability robustness. Fortunately, we have already developed all the 
tools we need to formulate this problem precisely and assess whether it has 
been achieved by a given design. 

In Section 3.9 we saw that it is possible to formulate frequency- 
domain performance specifications in terms of the operator norm ||. || ,. 
Using out standard representation of uncertainty, shown in Figure 3.9, we 
partition the transfer function Q so that 


yY] vl Qi Qiz v 
j-e l-le oe lle oe 


and define the variables v and y such that 
IQirllo<t (3.167) 


becomes the performance specification. 


EXAMPLE 3.4 


Suppose that the performance specification is 


WS 
WT 


<1 (3.168) 


ao 


Figure 3.17 Definition of the variables v and yin equation (3.166) so that the 
specification (3.168) can be expressed as || Q,, ||. <1. 
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where S is the sensitivity function, T is the closed-loop transfer function 
(T =1—S),and W, and W, are frequency-dependent weighting matrices. 
Then v and y = [y, y2]" are defined as in Figure 3.17, in which G and K 
are the perturbed plant and compensator, respectively. 


With the perturbation A present, as shown in Figure 3.9, the relation- 
ship between v and y becomes’ 


y=[04,+Q:2.A(I—Q22A)*Qai de (3.169) 
so that the criteria for performance robustness can be stated precisely as: 


| Qi +Q12A(1— Q224) Qz Ila <1 (3.170) 
and 


| Qo2Il,< t (3.171) 


where A =diag{A,,...,A,} and |All <1, and (3.171) comes from 
Theorem 3.6. 

Suppose that, instead of imposing a performance specification, we 
were to add an additional uncertainty block Ay between y and v, with 
\| Ag lo < 1, as shown in Figure 3.18, and require stability robustness in the 
face of any possible perturbation, including the fictitious Ay. That is, suppose 
that we required 


|@ll,<1 (3.172) 


where ||. ||, is now computed with respect to the structure diag {AoA} = 
diag {A ,A,,...,A,}. Then, since stability would be required for perturba- 
tions diag {A,,0} and diag {0, A}, (3.172) would certainly imply that both 


Figure 3.18 Replacement of a performance specification by a fictitious un- 
certainty Ao. 
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| Q11 lo <1 and || Q.. ||, < 1 were satisfied. Rather remarkably, it can be 
shown that (3.172) also implies that (3.170) is satisfied for all permissible A; 
moreover, (3.172) is actually a necessary condition for (3.170) to hold. 

This result is of great importance in the analysis of feedback systems, 
and opens the way to systematic methods of synthesis, as we shall see in 
Chapter 6. 


Theorem 3.7 (Performance robustness theorem — Doyle et al., 1982): 
The condition (3.170) holds for all Ae BD, and (3.171) is satisfied 
(computed with respect to the structure of A) if and only if 


lQi,<1 

where 
= 014 A 3.173 
s e On, oe 


and | Q ||, is computed with respect to the structure of diag {A,, A} 
(provided Q is stable). 


Proof: 
I Qi. =supa(Q(jo)) (by definition) 
Now, 
u(Q(jw)) <1 (3.174) 
if and only if 
det (I — Q(jw) A) > 0 (3.175) 


for all œ (by definition of u(.)), and for all A = diag {A,,A}, with 
lâol < 1 and AeEBD,. Now, 


I— Qi: âo aed (3.176) 


det(J-—QA)= aer ahh, £208 


= det (I — Q,,A)det [(1— Q11 Ao) 


—Qı2A(I— Q224) ' Q240] 
(by Schur’s formula) (3.177) 


= det (7 — Qz: A)det (1 — [Q1 +0124 
x(I— Q224) +Q21]40) (3.178) 
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Since (3.178) must hold for A = 0 and Ay = 0, we deduce that (3.175) 
holds if and only if 


det(I—Q,,A) >0 (3.179) 


and 


det (J —[Q1, + Q,.A(I—Q2,A)"*Q2,]Ao) >0 (3.180) 
for all permissible A and Ay. But (3.179) is equivalent to 


lQ22l <1 


and, using the same argument as in Section 3.11.1, (3.180) is equivalent 


to 


| Q11 + Q12 AU — 24) t Qa: 1a <1 


EXAMPLE 3.5 (Doyle et al., 1982) 


We are given a nominal plant 


Go (s) -S J (3.181) 
with unstructured multiplicative input uncertainty, so that the true plant is 

G(s) = Go(s)(I + Aj) (3.182) 
where 

a(A) <|- n | (3.183) 


for some constant a. The performance specification is 


joa 


G(S(ja@)) < a (3.184) 


where S=(I+GKY +. Note that increasing « has the effect of making 
performance robustness easier to achieve. 

We wish to determine the values of « for which the specification is 
achieved in the face of uncertainty, with the particular compensator design 


0118 1 
K= 
| 1 id ei) 
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Figure 3.19 Block diagram corresponding to Example (Doyle et al., 1982). 


To do this we first reformulate (3.183) and (3.184) as 


All. = I <i ; 
lâle = frä, (3.186) 
ie 
pam ee (3.187) 
jouw” I, 


and obtain Q corresponding to Figure 3.9. The detailed block diagram for 
this case is shown in Figure 3.19, from which we obtain 


= O11 O12 
2 Fe | 


4 1 
TPG RY a PAG Ke Gs 
aS as 
1+s ʻi 1+s -1 
(3.188) 
Now we have 
u(Q) = infé(DQD~') (3.189) 
D 


where 


D= | Oa zl (3.190) 
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1 w 
0.01 0.1 l 10 100 


Figure 3.20 Structured singular value of Q with «= 1 (see text). 


the equality holding in (3.189) since D has only two blocks, and (3.189) 
simplifies, in this case, to 


u(Q) = inf 7 qQi2 (3.191) 
zz Q22 


where d is a positive real scalar. A plot of u(Q) with « = 1 is shown in 
Figure 3.20, from which we find || Q ||,, = 18.5. 

Since |Q||,, scales inversely with a, the compensator (3.185) 
achieves performance robustness provided « > 18.5. 


SUMMARY 

In this chapter we have seen that the all-important notion of the gain of 
a feedback system can be extended to multivariable systems. For multi- 
variable systems this gain is a range of values, each corresponding to a 
particular input signal direction. The lower and upper boundaries of this 
range are given by the smallest and largest frequency-dependent singular 
values, or principal gains. Analysis of feedback systems often requires the 
inspection of graphs of these principal gains, which look just like the 
familiar Bode plots used for SISO design. 

We have seen that Bode’s gain-phase relations continue to hold for 
multivariable systems, and that the same broad conclusions can be drawn 
about gain characteristics near the cross-over frequency, and about the 
effects of right half-plane zeros and poles, as for SISO systems. 
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To examine the robustness of multivariable designs we have intro- 
duced a new representation of plant uncertainty which is powerful enough 
to model a wide variety of both structured and unstructured perturbations. 
By exploiting the flexibility of this representation, we found that we could 
examine a wide variety of robustness questions by borrowing only one idea 
from traditional analysis—that of gain margin. 

‘The use of principal gains, or operator norms of suitably weighted 
transfer functions, allows the specification of both performance and robust- 
ness requirements to take the same mathematical form, and this leads to a 
very powerful result: performance specifications are, in a precise sense, 
interchangeable with uncertainty specifications, and the same tools can be 
used to assess the attainment of both types of specification. This begins to 
explain the well-known fact that systems have to be ‘detuned’ if their 
behaviour is very uncertain. 

If the approach taken in Sections 3.10 and 3.12 is adopted, then all 
problems of feedback analysis become multivariable. There is always some 
performance specification, and some description of uncertainty, so that Qis 
at least a 2 x 2 matrix, even if both the plant and the compensator are SISO. 
This does not mean, of course, that multivariable techniques should neces- 
sarily be applied to design. 


Notes 


1. The term ‘principal gain’ was introduced by MacFarlane and Scott-Jones 
(1979). It has not become well-established, and principal gains are often 
referred to merely as ‘singular values’. But because of the possible confusion 
with the singular values of the matrix A of a state-space realization, which are 
not frequency-dependent, it seems preferable to make the distinction, just as it 
is useful to use the term ‘characteristic loci’ rather than ‘eigenvalues’. 


2. Orthonormal columns can always be obtained by means of the QR factoriz- 
ation, described in Section 8.2.3. 
3. The identity /—(1+X)~!=(1+ X~')~? occurs frequently in the analysis 


of feedback systems, and it should be noted that it also equals X (I+ X)~ lit 
can be verified with a little algebra, but can also be obtained as a special case 
of the matrix-inversion lemma 


(A+ BCD)~! = A~!— A~!B(DA~!B+C7—1)"!DA“! 


which is useful in systems theory, optimal control and Kalman filtering 
theory. See Kailath (1980)—where it is called the ‘modified matrices 
formula’ — for references, and Åström and Wittenmark (1984) for a derivation. 


4. The standard block-diagonal representation of structured uncertainty de- 
scribed in this section was introduced by Doyle, two of the earliest references 
being Doyle et al. (1982) and Doyle (1982). 
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The design technique developed by Horowitz has recently acquired the name 
‘quantitative feedback theory’ (QFT). It is described in Chapter 4. 


The notation here follows Doyle (1982), who also gives proofs of some of the 
results presented in this section. It is possible for each A; to be non-square, but 
we do not treat this case. 


In the notation introduced in Section 6.1, we can write the inequality (3.170) 
as || Fi(Q,A) llo < 1. 


Using Lemma 3.2, verify that if ¢(Q)«1 then 6[I1—(1+Q)"?+] 
~ (Q). 


Suppose that G and K are both square and invertible. By writing 
GK = G(KG)G~' show that: 


(a) (GK) < 6(KG)cond(G) 
b) o(GK)> = 
and establish corresponding bounds on o(KG) and &(KG). 


Consider a plant 


and a controller 


K= ie 0 | 
kzı ky, 
Show that choosing kz; = —k,,g makes the return ratio GK normal 
(in fact, diagonal). Suppose that k,, = 1 and nominally g = 103, so 


that kı = — 10°. Show that a 1% change in g can make the return 
ratio extremely skew. What is the condition number of G in this case? 


(a) Express || G ||, in terms of the principal gains of G. 

(b) Suppose that {u(t)} has: power spectral density ®,,,(q@), that 
y(s) = G(s)u(s) and that G(s) is stable. Obtain an expression 
for E{y’Wy! in terms of the norm |. ||2, assuming that 
W'=We0. 
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3.5 


3.6 


3.7 
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A multivariable-feedback system is to attenuate all output disturb- 
ances by at least a factor of 10 at frequencies below 0.1 rads~', and to 
attenuate output sensor noise by at least a factor of 10 at frequencies 
above 2rads~!. Constant output disturbances should be attenuated 
(in the steady-state) by a factor of at least 100. 


(a) 
(b) 
(c) 
(d) 


(e) 


Formulate suitable specifications on the closed-loop transfer 
function T and sensitivity S, in terms of their principal gains. 
Obtain a corresponding specification on the open-loop 
return ratio GK. 

Reformulate the closed-loop specifications using the 
operator norm ||. || and suitable weighting functions. 

From (b), estimate the cross-over frequency, the maximum 
‘phase margin’ which could be exhibited by a characteristic 
locus, and hence a lower bound for || T ||- 


(Solution: œ, ~0.45rads~', max. ‘phase margin’ =~ 36°, 
|| T ||. = 1.6 (approximately)) 

Is the lower bound obtained in (d) consistent with the 
specification you obtained in (c)? 


Consider the aircraft model AIRC given in the Appendix. Suppose 
that a,, = 1.132+5% and b4, = 4.419 + 10%, and that all other 
parameters are known exactly. 


(a) 


Using the controller designed in Section 4.4, obtain a 
representation of the closed-loop system in the form of 
Figure 3.9. 

Examine the stability robustness of the system, by 
computing ||Q9 | «- 

Examine the stability robustness by estimating ||Q,.||,. 
Examine the stability robustness by the use of a Nyquist array. 


Find the transfer-function matrix of the plant model P(s) 
shown in Figure 3.12. 

In Figure 3.12, y, and y, are the controlled variables. These are 
fed back through conventional proportional-plus-integral 
compensators, each with transfer function 


kı + skp 
s 
to the flow control valWes (see Section 3.10.1 and 3.10.2). In 
order to investigate the effect of a loop failure, one of the 


compensators is assumed to have a multiplicative uncertainty 
of 100% in each of its parameter values. Draw a detailed block 


3.8 


3.9 


3.10 


(c) 


(b) 


(a) 


(a) 


(b) 


(c) 
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diagram which includes this uncertainty, in the form shown in 
Figure 3.9. . 
Does (b) describe a sensible way of investigating loop failure? 


Consider the feedback loop shown in Figure 3.13, with A, = 0. 
For each of the possible descriptions of plant uncertainty 
which appear in equations (3.81), (3.82), (3.90) and (3.121), draw 
the diagram corresponding to Figure 3.14, find expressions for 
Q11, Qi2, Q21 and Q,, (replacing those which appear in 
(3.107)—(3.110)), and hence find a necessary and sufficient con- 
dition for robust stability, assuming only that A is stable and 
that ||A||,, < 1 in each case, where A= W AW,. 

Repeat (a) for simultaneous input and output uncertainty, as 
modelled by (3.88), and show that this is in fact an instance of 
structured uncertainty. 


Using the notation defined in Section 3.11.2, show that 
DAD! =A 


Prove the properties of the structured singular value u(.) 
stated in (3.135}1{3.140) and (3.142}H13.143). 

Show that Ae BD, (see Section 3.11.2 definitions), if and only if 
lA llo < 1. (This is needed for the proof of Theorem 3.6.) 


If A has elements {ô;;} (|6,;| <7,;) and {r,;} are the elements 
of R, then o(A) < a(abs(A)) < F(R). Show that, if D, and D, 
are diagonal, real matrices with positive elements, then 
o(D,AD,) < E(D, RD). Hence show that 


p(QA) < E(D, RD,)a(Dz' QD; *) 
Given a ‘full’ feedback perturbation A about a compensated 
plant Q, with element-by-element bounds lő; < rij, show 
explicitly how to obtain an equivalent diagonal perturbation 


A, and the corresponding compensated plant transfer function 
Q, with the bounds |6,;| < 1. 


e tai 3+ji 
1+j2  0.9-j0.21 


R= 0.5 1 
0.65 0.55 


Given 


and 
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(Kouvaritakis and Latchman, 1985), find the corresponding Ọ, 
as defined in (b), and estimate u (Q) by using the methods of 
Safonov and of Kouvaritakis and Latchman. 


(Solution: Safonov, 5.081; Kouvaritakis and Latchman, 4.732) 


3.11 Estimate (Q) for Example 3.5 by using equation (3.152), and com- 
pare your estimate with Figure 3.20. 


Repeat the assessment of performance robustness for the 


compensator 


10(s+ 1) 
Bee f =] 3s(s+ 16) > og 
2 3 2 _3 
3 4 0 9(16s + 1) 3 4 


32s(s+ 1) 


This compensator design is taken from Doyle et al. (1982). 


(Solution: Performance robustness achieved for « > 5.5) 


3.12 (a) 


(b) 


When a sampled-data system is designed using continuous- 
time techniques, the dynamic effects of a zero-order hold are 
often approximated by the transfer function 


1 
1+4sT 


(Franklin et al, 1986). Show that, if the sampling process is 
considered as a perturbation of the plant model, then the 
techniques described in this chapter can be used to select a 
sampling rate, assuming that a continuous-time design has 


already been performed. 
What are the limitations of this approach? 


A plant and compensator are given by 


T _ 0.81(s+0.2) 
g(s) =, 52’ k(s) = s+ 2 


(Franklin et al., 1986). The design specification is that 


1.25 
s? + 1.26s + 0.81 


= 
S=jo 


1+9(s)k(s) 


s=jo 


| g(s)k(s) 


If the compensator is to be implemented digitally, estimate the 
lowest acceptable sampling rate. Obtain digital equivalents of 
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k(s) and g(s) (see Franklin and Powell, 1980, or Franklin et al., 
1986, for methods of doing this) with your chosen sample rate, 
and check whether the specification is met (with frequency 
responses evaluated on the unit circle). 


————— aeaea a 
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Multivariable Design: 
Nyquist-like 
Techniques 
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4.1 Introduction 


In this chapter we look at those design techniques that arise as gen- 
eralizations of the classical SISO techniques reviewed in Chapter 1. When 
faced with a multivariable feedback design problem, one should certainly 
begin by trying one or more of the methods described here. They can often be 
made to work with relatively little effort, and they have the advantage that 
the designer has more control over the details of the control law than is 
possible with the methods introduced in Chapters 5 and 6. They can also be 
understood more easily by those familiar with the classical methods. 

In this chapter the design techniques are arranged in an order which is 
governed by convenience of presentation, rather than the order in which they 
should be attempted on a design problem. A personal view of the order in 
which they should be attempted is the following. Try sequential loop closing 
first, since this is the simplest thing to do (but it is also the least powerful, so 
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do not be dismayed if it fails). Then try Nyquist-array methods. These allow 
you to work in the real-coordinate frame of the problem, which in turn allows 
the understanding of any special features of the plant to be exploited in the 
design. Finally, turn to the characteristic-locus method. This is very powerful, 
and less conservative than the Nyquist-array approach, but forces you to 
work in the eigenframe of the plant, which usually makes it much more 
difficult to relate observed characteristics to the physical behaviour of the 
plant. 

If none of these approaches has been successful, then it will be 
necessary to move into deeper water. The reversed-frame normalization 
method may be helpful, but there have not been many successful uses of this 
to date, and it should probably be regarded as being mainly of theoretical 
interest. Quantitative feedback theory is a much more proven technique, but 
although it is (briefly) described in this chapter, its complexity is comparable 
to that of the techniques presented in later chapters. 


4.2 Sequential loop closing 


The simplest approach to multivariable design is to ignore its multivariable 
nature. A SISO controller is designed for one pair of input and output 
variables. When this design has been successfully completed, another SISO 
controller is designed for a second pair of variables, and so on. Figure 4.1 
illustrates the procedure. When designing each controller, the effects of those 
controllers around which feedback loops have already been closed should of 
course be taken into account. This is very easy to do, using a computer-aided 
design facility, since any relevant objects such as Nyquist loci can be readily 
computed and displayed. In fact, the Nyquist loci of all the remaining open 
loops can be displayed, and this may help in choosing which loop to tackle 
next. 


Figure 4.1 Sequential loop closing. 
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G(s) 


Figure 4.2 Sequential return difference. 


Such a sequential design procedure is often adopted in practice. It is 
appropriate when the controller structure is constrained to be diagonal- that 
is, to be such that no interactions between loops can take place. (The transfer- 
function matrix of such a controller is diagonal.) It also has the advantage 
that it can be implemented by closing one loop at a time, since the design 
procedure ensures that the system remains stable at each step. This is an 
important consideration in process control. If a fully cross-coupled controller 
is used, all feedback loops must be closed simultaneously; if only some of 
them are closed, there is no guarantee that the system will be stable. 

A sequential procedure, though, suffers from a number of drawbacks. 
It allows only a very limited class of controllers to be designed, and the design 
must proceed in a very ad hoc manner. Design decisions made when closing 
the first one or two loops may have deleterious effects on the behaviour of the 
remaining loops. And the only means available for the reduction of inter- 
action (if this is a requirement) is to use high loop gains. 

If we had available powerful methods of deciding how input and 
output variables should be paired (the loop-assignment problem) then sequen- 
tial loop closing would have a lot to recommend it. But unfortunately this 
aspect of multivariable design is relatively undeveloped (it is discussed in 
Section 4.10). 

A more sophisticated version of sequential loop closing, proposed by 
Mayne (1973, 1979), is called the sequential return-difference method. Mayne 
suggests that, before starting to design the individual loop compensators, a 
cross-coupling stage of compensation should be introduced. This stage 
should consist of either a constant-gain matrix or a sequence of elementary 
operations (scaled by a common denominator, to obtain a realizable 
compensator — see Section 2.2), and its purpose is to redistribute the “diffi- 
culty of control’ among the loops. Figure 4.2 shows the structure of the 
resulting controller; K,(s) = U(s)/d(s), where U(s) is unimodular (that is, 
det U(s) = const.), and so introduces no Zeros. 
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EXAMPLE 4.1 (Mayne, 1979) 


Suppose that 


o 1—s 4-s 
69)= P| 35 2 “D 


If we try to design a SISO controller for either the first or second 
loop here, we have difficulties, if the required bandwidth is close to unity, 
or greater, because the transfer function ‘seen’ for the design, namely the 
(1, 1) or (2, 2) element of G(s), has a zero at +1. However, G(s) itself has a 
transmission zero at — 1 only, so there should be no inherent difficulty of 
this kind. If we choose 


10 
oo (4.2) 


_ fin | tts 4-s 
Q(s) = G(s)K, = al 1 | (4.3) 


S 


then 


and we see that no right half-plane zero ‘appears’ when a SISO 
compensator k; (s) is being designed for the first loop. Once the first loop 


Figure 4.3 Sequential return-difference controller: loop 1 already closed, loop 
‘Z still open. 


SEQUENTIAL LOOP CLOSING 141 


has been closed, the transfer function ‘seen’ in the second loop is 


ross eR a (4.4) 
12219) = F H HIY 
(see Figure 4.3), where 
—k,(s) 
h(s) = 45 
O =T T+ ost DK) oe 
Now, if we assume that 
(s+1)? 
| ky (s) | > 4 +s 
(high gain in the first loop) then 
l (s+1)? 
Pee 4.6 
h(s) % (46) 
and hence 
1 
932 (s) &¥ ———____ (4.7) 


(3s+1)(s+1) 


so that no right half-plane zero is seen when the compensator for the 
second loop is designed. 
en entaneeeeeneenesen 

This example shows both the main idea and the main weakness of the 
sequential return-difference approach. The main idea, that of using a first 
Stage of compensation to make subsequent loop compensation easier, is 
potentially useful. But the main weakness of the method is that little help is 
available for choosing that first stage of compensation. The available analysis 
relies on the assumption that there are high gains in the loops which have 
already been closed (as in the example), and such an assumption can rarely be 
justified, except at low frequencies. 

One rather special case, in which the assumption of high gains is 
justified, arises when a different bandwidth is required for each loop, and all 
the bandwidths are well separated from each other. In such a case, if one 
starts with the fastest (highest-bandwidth) loop, and then designs the second- 
fastest, and so on, then the assumption of high gain is justified at each step. 
Suppose, for example, that there are two loops to be closed, with required 
bandwidths of about 100 and 10rads~!, respectively. Once. the faster loop 
has been designed, the design of the 10rads~! loop is undertaken, on the 
assumption that the 100rads~! loop has been closed. The cross- 
over frequency for the second loop should be somewhere between 5 and 
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10 rad s~‘, and over this frequency range the (open-loop) gain of the first loop 
is likely to be at least 10. Such a separation of bandwidths arises when there 
are large differences in the possible rates of change of the output variables; for 
example, a gas pressure can usually be changed much more quickly than a 
liquid level. 

Mayne (1979) suggests that, if the plant has a state-space realization 
(A, B, C), the product CB being non-singular (and the matrix D being zero), 
then the first stage of compensation can be chosen to be 


K, = (CB)! (4.8) 


The idea behind this is that 
G(s) -2 as |s| > 00 (4.9) 
(see Section 2.6) and hence 
I 
G(s)K, me as |s| > 00 (4.10) 


so that each loop looks like a first-order SISO system at high frequencies. The 
problem now is that the approximation (4.9) is likely to hold only at 
frequencies well beyond those at which loop compensation is required.’ 
However, an alternative choice, such as 


K, = G7! (jo) (4.11) 
or 

K, © jo, G7! (jo) (4.12) 
may be helpful, where œ, is the required bandwidth (assuming that a single 
bandwidth can be specified for the whole system). It will be shown in the next 
section that a similar first stage of compensation is used in the characteristic- 


locus design method; a method of performing the approximation required by 
(4.11) or (4.12) will also be given. 


4.3 The characteristic-locus method 


4.3.1 Approximate commutative compensators 


The basic idea behind approximate commutative compensators (MacFarlane 
and Kouvaritakis, 1977) is to manipulate the characteristic loci as if they were 


THE CHARACTERISTIC-LOCUS METHOD 143 


ordinary Nyquist loci. Let us concentrate first on how this can be done, and 
then on why it should be useful to do it. 

Suppose we have a square transfer-function matrix G(s), with m inputs 
and outputs, which has a spectral decomposition 


G(s) = W(s)A(s)W7*(s) (4.13) 


where W(s) is a matrix whose columns are the eigenvectors, or characteristic 
directions, of G(s), and 


A(s) = diag {/,(s),A2(s),. . .,Am(s)} (4.14) 


where the ,(s) are the eigenvalues, or characteristic functions, of G(s). If the 
compensator K(s) is given the structure 


K(s) = W(s)M(s)W7 '(s) (4.15) 
where 
M(s) = diag {4 (s), 2 (s),. G -s Hm (S) } (4.16) 


then the return ratio is 


—G(s)K(s)= —W(s)A(s)M(s)W~*(s) (4.17) 
= —W(s)N(s)W*(s) (4.18) 
where 
N(s) = diag {v,(s), v2(s),. - -,Vm(S) } (4.19) 
and 
vi(s) = 4;(s)u;(s) (4.20) 


So we see that if the plant and the compensator share the same 
eigenvectors, then the system obtained by connecting them in series has 
eigenvalues which are simply the products of the plant and compensator 
eigenvalues. 

The strategy which suggests itself is to obtain graphical displays of the 
characteristic loci of the plant, {/;(jw): i=1,...,m}, and to design a 
‘compensator’ y;(j@) for each A;(jw), using the well-established single-loop 
techniques. One would then obtain the compensator as the series connection 
of three systems, corresponding to W` t(s), M(s) and W(s). 
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It is easily checked that, if K(s) has the structure specified in equation 
(4.15), then G(s) and K(s) commute, namely 


G(s)K(s) = K(s)G(s) (4.21) 


A compensator having this structure is called a ‘commutative’ compensator. 
(Also, this is why we were able to refer earlier to ‘the return ratio’ rather than 
‘the return ratio at the output of the plant’.) 

Unfortunately, it is quite impractical to attempt to build a com- 
mutative compensator. The problem is that the elements of the matrices W(s) 
and W`! (s) are almost always irrational functions, which have no practical 
realizations. 


EXAMPLE 4.2 


The transfer-function matrix 


1 2 
s+1 s+! 
G(s)= 
2 1 
s+2 s+2 


has characteristic functions 
1 
(s+ 1)(s +2) 


The characteristic directions are given by 


wils) Z a| ~s—1 + V(16s* + 485 + 33) | 
W;(s) s+2 


dy, o(s) = [2s +3 + ./(16s? + 48s + 33)] 


A practical alternative is to give the compensator the structure 
K(s) = A(s)M(s)B(s) (4.22) 
where A(s) and B(s) are chosen to be realizable, and such that 


A(s) = W(s) (4.23) 


and 
B(s) x W+ (s) (4.24) 


(see Figure 4.4). 
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=f 
I 


B(s) W(s) = W`’ (s) A(s) =I 


Figure 4.4 The approximate commutative compensator. 


This gives rise to a potentially infinite family of techniques, depending 
on the algorithm chosen for arriving at the approximations A(s} and B(s). In 
general, one could envisage using rational approximation theory to arrive at 
rational matrices A(s) and B(s) which could be realized as dynamic systems. 
Whichever approximation technique is chosen, we obtain an approximate 
commutative compensator, although in the literature only the simplest case 
has been investigated, namely that which arises when A(s) and B(s) are 
constant matrices, and can therefore be realized by networks of amplifiers, 
without any dynamic elements. 

One way of choosing constant matrices A and B is as approximations 
to W(s)) and W ~*(sy) at some particular point sọ. Usually one would choose 
So =j@y for some (real) wọ, so W(so) and W~1(so) are usually complex 
matrices, which need to be approximated by real matrices. 

A successful algorithm for computing constant real matrices for use as 
the matrices A and B is the ALIGN algorithm (Kouvaritakis, 1974). This is 
based on the observation that a commutative compensator is obtained at a 
point sọ even if A is not exactly equal to W (so); it is sufficient that the columns 
of A have the same ‘directions’ as the columns of W (sọ), namely that they are 
scalar multiples of the columns of W(s,). Writing 


A = (4,,43,... 5m) (4.25) 


and 

W (So) = (Wy, Wo, --- 5 Wm) (4.26) 
we obtain a commutative compensator at sọ if 

a; =w,z;, t=1,2,...,m (4.27) 
for some scalar complex numbers z;, and 

B= A~! (4.28) 


If we now define 


V(s) = W~1(s) (4.29) 
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and write 

Spy Sos + Vm) (4.30) 
then 

vya,=0, iŻj (4.31) 


if equation (4.27) holds. Consequently, the ALIGN algorithm chooses the 
columns of A according to: 


a; = arg max ————— (4.32) 


“o S oal? 
j#i 


This maximization problem can be solved by solving a generalized 
eigenvalue problem, as shown by MacFarlane and Kouvaritakis (1977). Let 
us write the function being maximized in (4.32) as 


a! Xa, 
ye ek AOE 4.33 
Ja) = rq (4.33) 
where 
X = vo” (4.34) 
and 
Y= ¥ vof (4.35) 
jti 


Since X and Y are Hermitian (X" = X and Y” = Y), we can replace (4.33) by 


a} Ca; 
Ja) =a (4.36) 


where C = Re{X} and D = Re{ Y}, and C and D are both symmetric and 
positive-semidefinite. 
Differentiating (4.36), we obtain 


ao a} Da, + 2J (a;)Da; = 2Ca; (4.37) 


which, setting 0J/da; = 0, gives 


J (a;)Da; = Ca; i (4.38) 
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which is a generalized eigenvalue problem (see Section 8.2.8). Thus the 
maximum value of J(a;) is obtained as the largest generalized eigenvalue 
of the pair (C, D), and a; is obtained as the corresponding generalized 
eigenvector. 

An equivalent formulation of the maximization problem (4.32) is as a 
least-squares problem (Edmunds and Kouvaritakis, 1979). If we observe that 
we can impose the constraint 


|a;| = || (4.39) 
without losing any generality, then we see that in (4.27) we must have 

z; = exp(jd;) (4.40) 
for some real 6;. It follows that we can write (if (4.27) holds) 

A = W(s,)diag {exp(j6;)} (4.41) 


which implies that 
V(So)A = diag {exp(jd;)} (4.42) 


The least-squares version of the ALIGN algorithm therefore chooses 
a; according to 


a; = argmin || V(so)a; — exp(jo,)e; || (4.43) 


aj, ĝi 


where e; denotes the ith standard basis vector, and the norm is defined by 
|x|? = x*x (4.44) 


Note that the solution of (4.43) is not unique, since the same value of 
the norm is obtained with the pair (—a,, ô+ n) as with (a;, 6;). For the 
purposes of alignment this does not matter. We shall soon see, however, that 
the ALIGN algorithm is very useful for obtaining real approximate inverses 
of complex matrices. When used for this purpose the sign of a; does matter, 
and it should be chosen so as to give the smaller value of || Va; — e;||. 

It can be shown that (4.32) and (4.43) are formally equivalent, 
although they naturally lead to alternative algorithms. 

Use of the approximate commutative compensator, with constant 
matrices A and B, clearly relies on the eigenvectors of the plant not changing 
too quickly with frequency, since the compensation would be useless if its 
effects were predictable at only one frequency. If the eigenvectors do change 
too quickly, one can try to choose A and B so as to approximate W(s) and 
W ~*(s) at several frequencies rather than one; this is achieved if one chooses 
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a; according to 


>; Py | UE (jora? 
a; = argmax —— (4.45) 


$ È Pr fz ef Goa} 


where {p,,..., Py} is a set of (real non-negative) weights which allow the 
approximation at some frequencies to be emphasized more than at others. If 
we write the function to be maximized as 


J(a,) = a (4.46) 
where 

X =X puljo) Go) (4.47) 
and 

Y= È Pi l Py vj(jo,) oF ( jod} (4.48) 


then we see that the solution is again obtained by choosing a; to be the 
generalized eigenvector which corresponds to the largest generalized eigen- 
value of the pair (Č, D), where Č = Re{ X} and D = Re{ F}. On the other 
hand, the corresponding formulation of the ‘multi-frequency’ problem as a 
least-squares problem does not have a simple solution. 

This refinement of the ALIGN algorithm frequently gives no better 
results than the basic version, which tries to match the eigenvectors at one 
frequency only. The reason seems to be that the price paid in not obtaining a 
good approximation to the eigenframe at any frequency often outweighs the 
advantage gained. If one attempts multi-frequency alignment with a dynamic 
compensator, however, one can obtain greatly improved frame alignment, as 
might be expected. We shall pursue this possibility further in Section 4.7.3. 

Henceforth we shall use ‘ALIGN’ to mean the single-frequency 
alignment algorithm, unless we explicitly state otherwise. 

If all the characteristic loci require shaping over approximately the 
same range of frequencies, then the choice of the frequency at which the 
ALIGN algorithm should be used does not present a major problem. But it 
often happens that different loci require compensation at different fre- 
quencies. One can then either choose one of these frequencies, and hope that 
approximate compensation is achieved on all loci satisfactorily, or compen- 
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sate each locus with a separate approximate commutative compensator: 
K(s)= K,(s)K,(s)... K,,(s) (4.49) 
where 
K,(s) = A;M,(s)B; (4.50) 


and K,(s) is designed to provide compensation at frequency j@;. The second 
alternative leads to a very complicated compensator, and suffers from the 
difficulty that each factor of the compensator (i.e. each K;(s}) interferes with 
all the others. 


4.3.2 Design procedure 


Now that we know how to manipulate the characteristic loci, let us consider 
why it should be useful to do so. One reason is, obviously, that it may allow 
us to obtain a stable design by satisfying the generalized Nyquist stability 
theorem. But merely attaining stability is rarely the objective of feedback 
design, and is certainly not the sole objective of the classical ‘locus-bending’ 
techniques. Do we gain anything if we do not merely achieve the required 
number of encirclements by the characteristic loci, but also ensure that the 
characteristic loci stay outside the M = ./2 circle, for example, or have large 
magnitudes at low frequencies? 

The answer is that we certainly need to give the characteristic loci 
features such as these, but that even if all the loci look satisfactory -when 
judged against classical SISO criteria, as if they were Nyquist loci—this may 
not ensure satisfactory performance of the multivariable feedback system. 


Suppose that the plant and compensator together have the spectral 
decomposition 


G(s) K(s) = W(s) diag {v,(s)} W ~1(s) (4.51) 
= ¥ wO (4.52) 


(using notation as in (4.26) and (4.30)). The sensitivity function is then 


S(s) = W(s) A i = oh W ~1(s) (4.53) 


1 
TG 


vi (s) (4.54) 
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Figure 4.5 Interpretation of the spectral decomposition of a multivariable com- 
pensated plant inside a feedback loop as a collection of SISO feedback loops. 


and the closed-loop transfer function is 


T(s) = W(s) diag i “at W -1(s) (4.55) 
Se an) (4.56) 
iE °° 1++,(s) : 


Expressions (4.54) and (4.56) lead to the following interpretation, 
which is illustrated by Figure 4.5. The input signal r(s) is ‘projected’ onto the 
ith characteristic direction w,(s). This projection results in the signal v/(s)r(s), 
which is the input to a (SISO) feedback system having the transfer function 
v,(s) in the forward path and unity negative feedback around it. The outputs 
of these m feedback systems are added together (vectorially) to give the 
system output y(s). The disturbance signal d(s) is similarly projected onto the 
characteristic directions, and each component of the disturbance enters 
the corresponding feedback loop in the usual way. 

It seems reasonable to expect that, if each of these m feedback systems 
is designed according to classical precepts (satisfactory speed of response, 
adequate rejection of disturbances and so on), then the whole multivariable 
feedback system will inherit these properties. Unfortunately, this is not 
always true. As we saw in Chapter 3, the performance of a feedback system 
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depends essentially on the principal gains of functions such as S(s) and T(s). 
Now, equations (4.53) and (4.55) show that these transfer functions have 
eigenvalue functions 


1 d f v,(s) } 
T+ O Yvo 


respectively, and we know that, for any matrix X, 


a(X) <|A(X)| < F(X) (4.57) 


where A(X) denotes any eigenvalue of X. We can therefore conclude that if 
the characteristic loci of GK have low magnitudes at low frequencies, then the 
sensitivity will certainly be large, in some signal directions, at those frequen- 
cies. And if some characteristic locus penetrates the M = 2 circle, then at 
least one principal gain of T(s) will exhibit a resonance peak, of magnitude 
greater than \/2. But we cannot draw the converse conclusions. 

Equations (4.53) and (4.55) also show that GK, S and T all share the 
same eigenvectors, and in Chapter 3 we saw that the magnitudes of the 
smallest and largest eigenvalues are close to the smallest and largest singular 
values if the eigenvectors are nearly orthogonal to each other. We can 
therefore expect that shaping the v,(jq) will lead to good system performance 
if the return ratio has low skewness (see Chapter 3). 

Note that the discrepancies between characteristic loci and principal 
gains do not make the shaping of characteristic loci a fruitless activity. On the 
contrary, manipulating the characteristic loci is often the most straight- 
forward and productive way of designing a multivariable feedback system, or 
at least of initiating the design. Of course, the properties of the resulting 
design must be checked by methods more revealing than examination of the 
characteristic loci. 

A frequent requirement for multivariable systems—one which does 
not arise for SISO systems — is that they should exhibit little interaction, that 
is, a change in one of the reference signals should cause only the correspond- 
ing output to change, without excessive transients occurring on the other 
outputs. Alternatively, the appearance of a sudden disturbance on one of the 
outputs should not disturb the other outputs excessively. (For systems with 
one degree of freedom, these statements are equivalent.) If we had ‘high gain’ 
everywhere in the sense that 


[v;(s)| > 1, for each i (4.58) 
then 


vs) 
I+ ~* aS 
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and so, from (4.55), we would have 
T(s)xI (4.60) 


This generalizes the familiar single-loop argument which shows the 
benefits of high loop gain, and demonstrates that lack of interaction is a 
benefit obtained in addition to the usual single-loop benefits. But we know 
that, beyond some frequency, the condition (4.58) cannot hold, both for 
physical reasons and because the generalized Nyquist theorem needs to be 
satisfied. Thus, at these higher frequencies, feedback cannot be relied upon to 
enforce low interaction: the only thing that can be done is to insert a series 
compensator whose main purpose is to obtain a ‘decoupled’ (that is, diag- 
onal) return ratio of — G(s)K(s) at (some of) these frequencies. One way of 
attempting to decouple the return ratio at one or several frequencies is to 
attempt to invert G(s) at these frequencies, and we already have an algorithm 
for doing this—the ALIGN algorithm. 

Suppose that we decide to use the ALIGN algorithm to obtain an 
approximate decoupling of the system at a particular frequency. How should 
that frequency be chosen? A closed-loop bandwidth œ, can be achieved (if the 
characteristic directions span the signal space well) by setting 


|v;(jo, )| 


1 
: = , for each i (4.61) 
|1 + v,(jo,)| 2 


which implies that |v,(ja,)| lies on the M =1/,/2 circle, as shown in 
Figure 4.6. From the figure it is clear that |v;(j@,)| cannot exceed 1 by much, 
and is usually less than 1, so high gain can no longer be relied upon to reduce 
interaction at the frequency w,. However, at frequencies much higher than œ, 
the ‘loop gains’ will usually be very low, so contributions to the output at 
these frequencies will be unimportant. This leads to the conclusion that ‘high- 
frequency decoupling’ should be attempted at or near @,. 

So the first step of the design procedure is to obtain a ‘high-frequency’ 
constant compensator K, ~ —G~‘(j@,). The negative sign is chosen be- 
cause this shifts the eigenvalues of G(ja,)K,, into the vicinity of — 1, rather 
than + 1. In the final design the characteristic loci at the frequency œ, should 
be closer to — 1 than + 1 (see Figure 4.6), so this choice is likely to make later 
design steps easier. One effect of the compensator K, is that it results in all the 
characteristic loci having similar values near the frequency «,, whereas the 
original plant’s loci may be very widely scattered at this frequency. This 
makes it easier to design an approximate commutative compensator in the 
next design step, since all the loci require compensation at similar frequencies. 

It may be advantageous to adjust the signs of the columns of K, at this 
stage. For example, some column of G(ja,)K, may be very far from a 
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Figure 4.6 A characteristic locus, with the M= V 2, M=1/ UE 2, and |S|=1 circles. 


column of a real unit matrix, in which case one of the characteristic loci will 
be very far from — 1 at frequency œ. In such a case changing the sign of the 
corresponding column of K, may bring this characteristic locus into a region 
in which subsequent compensation becomes easier. In particular, it may 
change the number of encirclements of —1 by the characteristic loci; it is 
usually easier to achieve the correct number of encirclements at this stage by 
means of sign changes, than at a later stage by means of dynamic compen- 
sation. Note that a change of sign does not just change the phase of one 
locus —in general it will alter the loci completely. 

Grosdidier et al. (1985) have shown that, if the compensator has the 
structure 


and G(s)C(s) is proper, then it is always possible to find a C(s) such that the 
closed-loop system is stable for a range of gains 


O<k<k* 


if all the eigenvalues of G(0)C(0) have positive real parts. Furthermore, it is 
impossible to find such a C(s) if any eigenvalue of G(0)C(0) has a negative real 
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part. This strongly suggests that the signs of the columns of K, should be 
adjusted to bring all the eigenvalues of G(0)K,, into the right half-plane, if 
possible. 

A suitable overall design procedure is the following, which has been 
called the characteristic-locus method: 


(1) | Compute a real Ky, ~ —G~*(ja,). 

(2) Design an approximate commutative controller K,,(s) at some fre- 
quency @,,<@,, for the compensated plant G(s)K,, such that 
K,,(j@) > I as œ > œ (the latter requirement is an attempt to ensure 
that the decoupling effected by K, is not disturbed too much). Ideally 
we should like K,,(j@) to approach I as œ > @,, but this is not a 
realistic goal. 


(3) If the low-frequency behaviour is unsatisfactory (typically because 
there are excessive steady-state errors), design an approximate com- 
mutative controller K,(s) at some frequency w, < @,,, for the compen- 
sated plant G(s)K,K,,(s), such that K,(j@) > I as œ > œ. 


(4) Realize the complete compensator as 
K(s) = K, Kn(s).Ki(5) (4.62) 


The subscripts h, m and | denote high, medium and low frequency, 
respectively. Usually, K,(s) is used to introduce integral action. 


Step 2, the medium-frequency compensation, is concerned with 
shaping the characteristic loci in the vicinity of — 1, and this is where phase 
lead or lag, or more complicated shaping, is applied to the loci. This means, 
however, that frequencies ,, and œ, are quite close together, and step 2 is 
likely to interfere with step 1. To reduce the interaction between these two 
steps it is often necessary to obtain K, at a frequency rather higher than @,. 
K,, can be designed at a frequency rather lower than the final 0 dB cross-over 
frequency of the loci, but the dynamic elements within it will inevitably have 
to exert their influence near this frequency. 

The characteristic-locus design technique is best suited to plants 
which are ‘uniform’ in the sense that it is feasible to obtain a similar 
bandwidth for each loop with reasonable control signals. If a plant is not 
uniform in this sense, for example because it has two outputs which can 
respond very quickly and two which can respond only rather slowly, then it 
may be possible to perform two (or more) characteristic-locus designs: the 
first one would usually be for the fast loops, and the second for the slower 
loops, the fast-loop design being taken into account. In other words, a kind of 
sequential loop-closing approach may be appropriate, the characteristic- 
locus method being used inside the sequential loop-closing approach. 


DESIGN EXAMPLE 155 


4.4 Design example 


(1) THE PLANT 


Consider the aircraft model AIRC described in the Appendix. This has three 
inputs, three outputs and five states. 


(2) THE SPECIFICATION 


We shall attempt to achieve a bandwidth of about 10 rads~! for each loop, 
with little interaction between outputs, good damping of step responses and 
zero steady-state error in the face of step demands or disturbances. We 
assume a one-degree-of-freedom control structure. 


(3) PROPERTIES OF THE PLANT 


The time responses of the plant to unit step signals on inputs 1 and 2 exhibit 
very severe interaction between outputs. The poles of the plant (eigenvalues 
of A) are found to be 


0, —0.78 + 1.03j, —0.0176 + 0.1826) 


so the system is stable (but not asymptotically stable). The number of finite 
zeros of the plant can be at most n —m — d, where n is the McMillan degree (5), 
m is the number of inputs or outputs (3), and d is the ‘rank defect’ of the 
product CB (see Chapter 2). For the present example, 


0 0 
CB=|-012 1 0 (4.63) 
0 0 


which has rank = 1, and therefore d = 2. Thus this plant has no finite zeros, 
and we do not expect any limitations on performance to be imposed by zeros. 

Figure 4.7 shows the three characteristic loci of the plant. Since 
the range of magnitudes is very large over the frequency range (0.001 to 
100 rads~*), these loci have been shown in the usual polar form, but with 
logarithmic calibration of the axes. The crescent-shaped dashed line is the 
M = ,/2 circle, distorted by the logarithmic calibration. The loci have also 
been displayed for both positive and negative frequencies, since this makes it 
easier to count encirclements. It can be seen that the loci encircle the point 
—1+0j (which lies inside the M = ./2 ‘circle’) twice clockwise and once 
anticlockwise, giving one net clockwise encirclement. We conclude that if 


negative feedback were applied around the plant as it stands, the closed loop 
would be unstable. 


156 MULTIVARIABLE DESIGN: NYQUIST-LIKE TECHNIQUES 


6 


Im (transformed) 


Re (transformed) 


Figure 4.7 Characteristic loci of plant, with logarithmic calibration of the axes 
(the broken curve is the M= af 2 ‘circle’). 


(4) ALIGNMENT AT 10 rads“! 


Since we are aiming for a closed-loop bandwidth of 10 rads~', we try using 
the ALIGN algorithm at that frequency in order to reduce interaction there. 
The resulting (constant) pre-compensator is: 


K, = ALIGN(— G(j10)) 


—71.535  —0.0036 —3.669 
=| —8.5375 —9.9984 —0.5376 (4.64) 
— 189.44 0.0065 —69.378 


This gives 
0.983 +0.068j 0.0001 —0.003j —0.0005+0.009j 


G(j10)K,=|  0.068+0.0004j —0.005 +1.010j 0.0008 + 0.0003} 
—0.009+0.100j 0.0000—0.0005j —0.9962 —0.086j 


(4.65) 


which shows that the only significant interactions left at œ = 10 are from 
input 1 to outputs 2 and 3, and that these interactions have been reduced to 
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Figure 4.8 Characteristic loci of GK,,, with logarithmic axis calibrations. 


10% or less (at that frequency). Note that the (2, 2) element is j rather than — 1 
(approximately); the power of the ALIGN algorithm is evident here, since it 
has allowed us to decouple the system at œw = 10, even though there is no 
good real approximation to — G(j10) £. 

In fact, it is more useful to have the second column of G(j10)K,, close 
to [0 —j 0]" rather than to [0 j 0]', since this results in one characteristic 
locus approaching the origin along the negative imaginary axis rather than 
along the positive imaginary axis, and gives no encirclements of — 1 by the set 
of characteristic loci, as we shall show below. We therefore change the sign of 
the second column of K,, to obtain 


—71.535 0.0036 += — 3.669 
K,=| —85375 9.9984 —0.5376 (4.66) 
—189.44  —0.0065 —69.378 


Figure 4.8 shows the characteristic loci of GK,, again with logarithmic 
calibrations. We see that one locus comes up along the imaginary axis from 
—Jjoo (because of the plant pole at 0) and we must therefore be careful about 
counting encirclements. 

If the Nyquist contour is imagined to be indented to the left around 
the pole at 0, so that one open-loop pole is enclosed within the contour, then 
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for closed-loop stability we need one anticlockwise encirclement of — 1 by the 
characteristic loci. We can see that the image of the indentation does indeed 
give one such encirclement, either by the usual conformal-mapping argument 
(since eigenvalues are analytic functions) or by direct calculation: the eigen- 
values of G(—10~4+j0)K, are —10°, 97 and 160 (approximately). This 
shows that one characteristic locus becomes large and negative as the 
indentation crosses the negative real axis, so the image of one locus of G(s) Ky, 
as s goes round the indentation (s = ce °: 4n < 0 < 32, 0<e« 1) is as 
shown by the dashed line in Figure 4.8. If we examined the loci in the vicinity 
of —1 we would see that the other two loci do not encircle it, so the net result 
is one anticlockwise encirclement. If we closed the feedback loops at this stage 
the system would be stable. 


(5) APPROXIMATE COMMUTATIVE COMPENSATOR 


As might be expected from (4.65), however, two of the loci pass extremely 
close to —1 at œ = 10 and require some compensation. Figure 4.9 shows the 
loci on a Nichols display, from which we can see that two of the loci require 
about 40° of phase lead at the 0 dB cross-over frequency if they are to remain 
outside the M = ,/2 circle. We should aim for a cross-over frequency rather 
lower than the required closed-loop bandwidth, and we choose 5 rads~* 
initially. We shall design an approximate commutative compensator, with the 
compensator K, still in place. A design was initially obtained at 5 rad s™ t, but 
some cut and try showed that the compensator was most effective when 


Gain (dB) 
è 


— 180 160 140 120 100 80 -60  —40 —20 0 
Phase (deg) 


Figure 4.9 Characteristic loci of GK,, on Nichols chart. 
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designed at 10 rads~*: details of this design only are given. We use the 
ALIGN algorithm again to obtain the constant matrices A and B which 
appear in equation (4.22). If 


G(j10)K, = WAW! (4.67) 
then 
0.2426 —0.2077 —0.0016 
A = ALIGN(W~') =| —0.0087 0.0079 0.9999 (4.68) 
0.6151 0.9656 0.0010 
and 


2.1937 0.0031 0.5587 
B = ALIGN(W) = | —1.3900 —0.0017 0.6491 (4.69) 
0.0278 1.0000 0.0014 


In order to determine which two of the diagonal elements of A require 
compensation, we have to examine the eigenvalues and eigenvectors of 
G(j10)K, when they are evaluated simultaneously by suitable software, to 
ensure that the ordering of the elements of A corresponds to the ordering of 
the columns of W. In this case it is the (1,1) and (2,2) elements of A that 
require compensation. 

We shall try a standard phase-lead compensator of the form 


a(l +sT) 
Tiar’ os 1 (4.70) 


From Chapter 1 we find that to get a maximum phase lead of 40° we need 
a = 0.2175. The maximum phase lead occurs at the frequency 1/T/a, so to 
obtain this phase lead at 5 rads”! we choose T = 0.429. We therefore insert, 
between A and B, the dynamic matrix 


0.0933s + 0.2175 R r 
0.09335 + 1 
M(s) = 7 0.0933s + 0.2175 (4.71) 
0.0933s + 1 
0 0 1 


In Figure 4.10 the resulting loci are plotted in Nichols form, and the loci are 
seen to have been given the required phase advance. The intersections with 
the M = 1/,/2 circle show that, if they were ordinary Nyquist loci of SISO 
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Gain 
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Figure 4.10 Characteristic loci of GK, Km, on Nichols chart. 


systems, then each of them would give a closed-loop bandwidth of about 
10 rads™ +, as required. 

Note that our medium-frequency compensator K,,(s) = AM(s)B 
satisfies K,,(s) > I as |s| > œ (approximately), but K,,(s) clearly has a signifi- 
cant effect at 10 rad s71, so that the decoupling obtained by K, may well have 
been destroyed. To check this we can evaluate G(j10)K,K,(j10). To evalu- 
ate the interactions at this frequency it is appropriate to examine 


0.5664 0.0027 0.0058 
abs (G(j10)K;,Km(j10)) =| 0.0239 0.9999 0.0014 (4.72) 
0.0579 0.0012 0.6797 


This shows that decoupling has not been destroyed at 10 rads” +. We still 
have an interaction of about 10% from input 1 to output 3, as before. 


(6) LOW-FREQUENCY COMPENSATION 


Figure 4.10 shows that one of the characteristic loci has a low-frequency gain 
of only about 10. Hence we must have o(G(0)K,K,,(0)) < 10, which will 
certainly not achieve the objective of zero steady-state error in the face of step 
disturbances. To attain this it is clearly necessary to introduce integral action. 
We shall apply compensation of the form (1 + sT)/sT to each characteristic 
locus, since this ensures that the integral action does not affect the loci at 
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Figure 4.11 Characteristic loci of GK,,K,,K,, on Nichols chart. 


frequencies much above 1/T. Since we do not want frequencies near 5 rad s~! 
to be affected much, we should choose T > 1/0.5 = 2, say. On the other hand, 
keeping T small extends the frequency range over which integral action is 
effective, and so speeds up the elimination of errors. We are therefore driven 
to choosing T = 2. We could choose a different value of T for each locus, but 
the advantage of having only one value of T is that the commutative 
controller becomes simply 


l+sT 
ee, 


K\(s) = 


(4.73) 


and each locus is shaped by (1+ sT)/sT at each frequency. Note that each 
locus will have an additional phase lag of 45° at œ = 0.5 (compared with 
Figure 4.10); this is acceptable, but will result in a conditionally stable system, 
since the resonance lobe exhibited by one of the loci will now cross the 
negative real axis. 

Figure 4.11 shows the characteristic loci of G(s)K,,K,,(s)K,(s) in 
Nichols form, while Figure 4.12 shows its smallest and largest principal gains 
in Bode form. Figure 4.13 shows the smallest and largest principal gains of the 
closed-loop transfer function, and shows that the bandwidth requirement has 
been achieved. Closed-loop time responses to step demands on each of the 
outputs are shown in Figure 4.14. Interaction is greatly reduced, compared 
with the open-loop plant, and the responses are well damped. 

Of course it is easy to obtain excellent step responses with a linear 
model. At this stage one should check typical control-signal levels, to 
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Figure 4.12 Largest and smallest open-loop principal gains. 
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Figure 4.13 Largest and smallest closed-loop principal gains. 
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Figure 4.14 Closed-loop step responses to step demand on output 1 (solid 
curves), output 2 (dashed curves) and output 3 (dash-dotted curves). 


determine whether the designed system would behave substantially in accord 
with the linear model. If it would not, then the original bandwidth specifica- 
tion was probably too ambitious. 


(7) REALIZATION OF THE COMPENSATOR 


We have designed the final compensator as the series connection of five 
compensators: one for high-frequency alignment (K,), three for shaping the 
characteristic loci near the cross-over frequency (Km) and one for shaping the 
loci at low frequencies by adding integral action (K,). The dynamical com- 
plexity of the compensator is quite low, however, and it can be realized as a 
five-state linear system (two states for the phase-lead transfer functions in K „, 
and three for the integrators in K,). 


The following is a state-space realization of the complete 
compensator: 


0 0 0 0 0 
0 0 0 0 0 

KA= 0 0 0 0 0 (4.74) 
1.0968 0.0016 0.2793 —10.7225 0 


—0.6950 —0.0008 0.3246 0 — 10.7225 
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1 0 0 
0 1 0 
KB = 0 0 1 (4.75) 


2.1937 0.0031 0.5587 
— 1.3900 —0.0017 0.6491 


— 29.3803 0.0167 1.8067 164.6078 — 94.9601 
KC=]| —3.5182 5.0006 —0.2555 20.8859 —11.1922 


— 78.0162 —0.0028 —33.7400 743.9029 232.0489 
(4.76) 


— 58.7607 0.0333 —3.6134 
KD = —7.0365 10.0011 —0.5110 (4.77) 
— 156.0324 — 0.0056 — 67.4800 


4.5 Reversed-frame normalization 


The characteristic-locus design technique is based on the eigenvalue- 
eigenvector decomposition 


G(s) = W(s) A (sW 7 (s) 


of a square transfer function G(s). Let us now focus instead on the singular- 
value decomposition 


G(s) = Y(s)£(s)U "(s) (4.78) 
where 

E(s) = diag {o,(s),..., m(s)} (4.79) 
the c; being the principal gains of G(s), and U(s) and Y(s) being unitary 
matrices. 


The o; are real-valued, being in essence measures of ‘gain’, but we can 
associate them with ‘phase’ information in the following way. First we define 


0,(s) = arg min ¢ [U ¥(s) Y(s) — diag {exp Gy] (4.80) 
Wi 
O(s) = diag {exp(j6;(s))} (4.81) - 


and 


m(G(s)) = E [U *(s) Y(s) — O(s) J (4.82) 
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The point of this is that if m(G(s)) = 0, then U (s) and Y(s) are aligned in the 
sense that, if u,(s) and y,(s) are their respective ith columns (namely the ith 
input and output principal directions, respectively) then 


yi(s) = u;(s) z(s) (4.83) 


where z(s) is some complex number, and if we normalize U (s) and Y(s) so that 
ll y:(s) | = lus) ||, then 


y:(s) = u,(s) exp (j6,(s)) (4.84) 
(Note that the idea here is the same as that of the least-squares version of the 
ALIGN algorithm, namely that defined by (4.43).) We shall say that G(s) is 


aligned if m(G(s)) = 0. 
If we now define 


F(s) = O(s)X(s) (4.85) 
and 

Z(s) = ¥(s)O*(s) (4.86) 
then we can write the decomposition (4.78) as 

G(s) = Z(s)I'(s)U *(s) (4.87) 


This is called the quasi-Nyquist decomposition of G(s), and it should be 
noted that 


I'(s) = diag {71(s), . . . , Y¥m(S)} (4.88) 
where each of the y; is now complex-valued, and so carries ‘phase’ as well as 
‘gain’ information, in some sense. 

The sense in which the y; carry ‘phase’ information is made precise by 


the following result, which implies that the graphs of (Jo) approximately 
locate the characteristic loci. 


Theorem 4.1: Let 4(s) be an eigenvalue of G(s). Then, for some y;,(s), 


|A(s) — y:(s)| < m(g(s)) max |+;(s)| (4.89) 


= m(G(s))o(G(s)) (4.90) 


Proof: See Hung and MacFarlane (1982), Chapter 3.4. 
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The loci of y,(j@) are called quasi-Nyquist loci, and it should be noted 


that 
Injo) = o;(jo) (4.91) 
Now, suppose that the compensator K(s) has the structure 
K(s) = U(s)M(s)Z"(s) (4.92) 
where 
M(s) = diag {1(5), - - - > Hm(S)} (4.93) 
Then 
G(s)K(s) = Z(s)T(s)M(s)Z"(s) (4.94) 
= Z(s)N(s)Z"(s) (4.95) 
= Y(s)N(s) Y"%(s) (4.96) 
where 
N(s) = diag {v1 (5), - - - » Vm(5)} (4.97) 
and 
v,(s) = y;(s) (Ss) (4.98) 


Then the return ratios G(s)K(s) and K(s) G(s) are both normal ([GK]"GK = 
GK[GK ]}, etc.) and aligned (m(GK) = 0). 

In this case we see that the quasi-Nyquist loci are precisely the 
characteristic loci, and that their magnitudes are the principal gains of both 
return ratios. We therefore approach more nearly the classical situation of 
being able to predict stability, performance and robustness from the same set 
of loci. 

A compensator with the structure defined by equation (4.92) is called a 
reversed-frame normalizing (RFN) compensator. Of course, such a compen- 
sator is no more realizable than an exact commutative controller, but, as 
might be expected, one can obtain useful results by settling for an approxi- 
mate RFN controller. 

Hung and MacFarlane (1982) have developed the above theory and its 
applications in great detail, and outlined an optimization-based technique for 
computing approximate RFN controllers. This technique assumes that one 
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knows what a satisfactory return ratio would be, in the form 
Q(s) = Y(s)N(s) Y"(s) (4.99) 


Here Y(s) comes from the singular-value decomposition of the plant, so that 
only the elements of the diagonal matrix N(s), namely the required quasi- 
Nyquist loci, need to be specified. Once Q(s) has been specified, the structure 
of K (s) must be specified in some way (for example, the order of each element) 
and numerical optimization used to find 


K(s) = arg min || Q(s)— G(s)K(s)|| (4.100) 
K(s)ex 


where x denotes the set of compensators over which the search is performed. 

Hung and MacFarlane performed least-squares optimization, and 
specified x as the set of transfer functions having a matrix-fraction description 
with a particular canonical form (Hermite form) and McMillan degree. 
Clearly many alternatives to these particular choices are possible; for 
example, one could constrain certain elements in K(s) to be zero. 

One could also attempt approximate RFN at one specific frequency, 
and hope that the resulting compensator would be beneficial over a useful 
range of frequencies, just as we did with the approximate commutative 
compensator. In particular, if G(s) has no poles at s = 0, then G(0) is real, and 
therefore U (0), Y(0) and £(0) are all real. In this case an exact (ats = 0)RFN 
compensator can be realized as 


K(s) = UO)diag {41 (9), ..., pm (8)} YO) (4.101) 


(since Y(0) = Z(0)). This can be particularly useful for manipulating the 
principal gains at low frequencies. 


EXAMPLE 4.3 


A partly compensated return ratio is closely approximated by the 
transfer function 


16.9 36.12 
s+5  s+11 
s)= 4.102 
2s) —9.57 —4.17 ( ) 
s+5 s+li 


It is desired to add integral action by means of a series compensator which 
will equalize the two principal gains at low frequencies, and give a loop 
gain of 50 at œ = 0.01. 
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We find that the singular-value decomposition at s = 0 is 


Q(0) = Y()ZO)U*O) 


0.940 0.342][5 O][ 0.766 0.643 
=| ose | l a a (4.103) 


A compensator of the form 


stk 0 
s 
K(s) = U (0) Y#(0) (4.104) 
0 s+5k 


S 


would have the desired effect since, at low frequencies, we would have 


k 
— 0 
0 
Q(jw)K (jo) — rols i jo q |70 (4.105) 
0 
jæ 


so that each of the principal gains would be 5k/w at low frequencies. To 
achieve a gain of 50 at œ = 0.01 we choose k = 0.1. 

Note that this compensator does not leave the return ratio un- 
altered at high frequencies; however, since both principal gains of K(s) 
become unity at high frequencies (w > 5, say), the principal gains of Q(s) 
and Q(s)K(s) are the same at high frequencies. 


4.6 Nyquist-array methods 


4.6.1 Compensator structure 


We shall again assume that the plant’s transfer function is square. Suppose 
that a compensator is rational, invertible, and has all its poles and (trans- 
mission) zeros in the left half-plane (including the origin); one normally uses a 
compensator of this form if possible. The following theorem shows that the 
compensator can always be obtained as the series connection of systems of 
three simple types. 


Theorem 4.2 (Rosenbrock, 1970): Let K(s) be square, rational and 
invertible, and have all its poles and zeros in the open left half-plane. 
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Then 


K(s) = K,K,(s)K,(s) (4.106) 
and 
K(s) = K{(s)Kj(s)K; (4.107) 


where K, and K; are permutation matrices (namely ones which 
permute rows or columns, or, in other words, reorder the outputs 
or inputs), K,(s), Kj(s) are products of elementary matrices, each 
elementary matrix having the form 


Ki(s) =I+ak(s)eel, j#i (4.108) 


where af(s) is rational and stable, e; and e; are the ith and jth 
standard basis vectors, and K,(s) and K,(s) are diagonal matrices, 


K,(s) = diag {k,(s)} (4.109) 


each k,(s) being rational, non-zero, and with poles and zeros in the 
open left half-plane only. 


Proof: Omitted. It is similar to the proof of the existence of the 
Smith-McMillan form, given in Chapter 2. See Rosenbrock (1970) for 
details. 


This theorem implies that compensator design can be split into two 
stages. In the first stage, K, and K,(s) are used to make the return ratio 
diagonally dominant, if possible (but the theorem does not guarantee that 
this is possible, of course). If dominance is achieved then the system has been 
decoupled to some extent, and behaves more like a set of independent SISO 
systems. The second stage begins when dominance has been achieved, and 
consists of designing a set of separate SISO compensators (the elements of 
K.(s), which is diagonal), one for each loop. At this stage no attention is paid 
to the remaining interactions in the system, except in so far as the 
Gershgorin bands of the return ratio replace SISO Nyquist loci. Closed-loop 
stability can be ensured by shaping the Gershgorin bands, using SISO 
techniques, in such a way that they do not overlap the point —1 + j0, and 
encircle it the appropriate number of times, in accordance with the gen- 
eralized Nyquist theorem. 

Since it is the point —1+j0 rather than the origin that must be left 
uncovered by the Gershgorin bands of the return ratio Qo(s), it is, strictly 
speaking, I + Qo(s) which must be diagonally dominant rather than Qo(s) 
itself. In practice, however, one usually aims to make Q,(s) diagonally 
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dominant, for several reasons. First, if a high degree of dominance of Qo (s) is 
achieved over a considerable range of frequencies (namely, if the Gershgorin 
bands of Q,(s) are made very narrow), then there is a good chance that 
I + Qo(s) will be diagonally dominant. Secondly, subsequent compensation 
by diagonal transfer functions can leave the diagonal dominance of Q,(s) 
unaltered, as explained below, whereas the dominance of I+ Qo(s) will 
always be changed. And thirdly, if Q o(s) is made very dominant, but I + Qo(s) 
is not, so that the Gershgorin bands of Q(s) are narrow but some of them 
cover the point —1+j0, then subsequent compensation by a diagonal 
transfer function can be used to swing the Gershgorin bands away from 
—1 +j0, with little effect on their width. 

Design can be pursued using either direct Nyquist arrays G(s), K,, 
K,(s) and K,(s), or inverse Nyquist arrays G ~ 1(5), K, 1, Kp 1(s) and Ko *(s). 
In either case stability can be determined from the return ratio evaluated at 
any point in the loop, provided the Gershgorin bands at that point do not 
overlap the point — 1 +j0. But to monitor performance, as well as stability, a 
particular return ratio must be examined, most commonly that at the output 
of the plant, 


Qo(s) = G(s)K,K,(s)K-(s) (4.110) 


Since K,(s) post-multiplies the other transfer functions in equation (4.110), 

- and is diagonal, the effect of each of its elements is to multiply one column of 
G(s)K,K,(s) by the same transfer function. Hence column dominance of 
G(s)K,K,(s) is not destroyed by K,(s), whereas row dominance may be. 
When working with direct Nyquist arrays it is therefore usual to attempt to 
achieve column dominance. When working with inverse arrays, on the other 
hand, we have 


O51(s) = Ko '(9)Ky (9) Ks G S) (4.111) 


Since KZ '(s) pre-multiplies the other transfer functions, the effect of each of 
its elements is to multiply one row of Kẹ ‘(s)K, +G- 1(s) by the same transfer 
function, so leaving row dominance undisturbed. Therefore, when working 
with inverse Nyquist arrays it is usual to try to achieve row dominance. 


4.6.2 The inverse Nyquist-array (INA) method 


The motivation for using inverse rather than direct arrays is that, in principle 
at least, they allow the closed-loop behaviour to be determined more pre- 
cisely from open-loop information. Consider K,(s), the diagonal part of the 
compensator, to be made up of the series connection of K,(s), another 
diagonal dynamic compensator, and F, a diagonal constant compensator, so 
that 


K,(s) = Ka(s)F (4.112) 
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H(s) 


Figure 4.15 Representation of a feedback system which allows the effect of 
varying feedback gains to be studied. 


This allows us to study the effects of changing the gain in each feedback loop 
while keeping the dynamic characteristics fixed. If we also define 


Q(s) = G(s)K,(s)K,(s)Ka(s) (4.113) 


then we have 


Qo(s) = Q(s)F (4.114) 


and the closed-loop transfer function becomes 


T(s) = H(s)F (4.115) 


where 
H(s) = [I + Q(s)F]~'Q(s) (4.116) 


(See Figure 4.15 for a block-diagram representation of this.) 

Rosenbrock (1970) introduced the notation H (s) to denote the inverse 
H ~'(s). This is useful because it allows us to refer easily to elements of H~! (s) 
as h;;(s), the point being that h;; # (h,;)'. In this notation we have 


H(s) = O(s)+F (4.117) 


and it is the simplicity of this relationship, as compared with (4.116), which 
underlies the use of inverse Nyquist arrays. 


IfF = diag{f,,f,,... »Jm} then setting f; = 0 represents opening the 
ith feedback loop. In this case we have 


172 MULTIVARIABLE DESIGN: NYQUIST-LIKE TECHNIQUES 


Guth Gi2 tae tee dim 
Gat Gar + f2 


ĝia- t fi-a 
ĝi, 
Gissjita Siri 


Å (s)| =0 = 


dmı y l poma Ômm + fm 
(4.118) 
Now suppose that z is the vector of signals at the input to H(s) (see 


Figure 4.15). The transfer function between z; and y; (where y is the output of 
the closed-loop system), with the ith loop still open, is 


L(s) = LÉ (S) p -0) Tis (4.119) 


_ det LH (9 Ij,=0] 
det LH (s)| s =0] 


(4.120) 


where H (s)|'#,=0 is just H(s)| sı=0 With the ith row and ith column removed. 
Now, if each f; > 00 (j # i) then, expanding by the ith row, we obtain 


det [Ê (s)|, -0] > du(s)detLH (917, =o] (4.121) 
and hence 

1,(s) > ee | (4.122) 

VAQ) ` 


This result has the following interpretation. If the gains in all the loops 
except the ith are very high, and the ith loop is open, then 4;;(s) (which 
contains open-loop information only) gives a good indication of the inverse 
response between z; and y;. The ith loop dynamics (namely the (i, i) element of 
K,(s)) can therefore be designed independently of the other loops, and 
inspection of 4,;(j@) for the final design will give a good indication of the 
behaviour of that loop (provided one is adept at interpreting inverse Nyquist 
loci- more on that shortly). 

In practice the loop gains cannot be made arbitrarily high, and are 
frequently constrained to be rather low, so it is important to have some 
measure of the error incurred by using g,(j@) as an approximation to 
1/1,(jo). This measure is provided by drawing Ostrowski bands. These are 
based on: 
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Theorem 4.3 (Ostrowski’s Theorem): Let AeC™*™" be (a constant 
matrix which is) diagonally row dominant. Then 


<| 5 al [max (5 lanl | anl) (4.123) 


and a corresponding inequality holds if A is diagonally column 
dominant. 


Proof: Omitted. See Rosenbrock (1970). 


If A( jæ) is row dominant, then we can apply this theorem to obtain 


Fuso - ggg < rogo) (4124) 
where 

pio) = F, inl jo) (4.125) 
and 

p(o) = max( 5 [hye jo)| | iy) (4.126) 
Now, recalling that 

haljo) = åja) + f; (4.127) 


and using the relationship 


EE E (4.128) 
h;(jæ) I;(jo) ` 


(see Exercise 4.3), we can rewrite (4.124) as 


jaio- 71 i,(jo) o) |< < p(w) d;(w) (4.129) 


Finally, since h, (JO) = ĝa (jæ) (if i # k), we recognize p;(@) as the radius of 
the ith Gershgorin circle of 01 jm) and, since H (jæ) is assumed to be row 
dominant, we have ¢,(w) < 1. 

We see, therefore, that 1/I;(jw) is contained within a circle whose 
centre coincides with that of the ith Gershgorin circle of Q(ja), and whose 
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Figure 4.16 An Ostrowski band. 


radius is smaller than that of the ith Gershgorin circle by the factor ġ;(œ). We 
call this circle an Ostrowski circle, and the union of such circles an Ostrowski 
band. Clearly, the ith Ostrowski band is contained within the ith Gershgorin 
band, if H (ja) is diagonally dominant. Diagonal dominance of H (jæ) corre- 
sponds to the ith Gershgorin band of Q (jœ) not covering the point — f; + j0. 
The situation is summarized in Figure 4.16. If H (jœ) is column dominant, 
then we obtain similar results by interchanging the roles of rows and 
columns. 

Once diagonal dominance has been obtained, INA design proceeds by 
attempting to obtain satisfactory Ostrowski bands, treating them as if they 
were inverse Nyquist loci of SISO systems. The interpretation of inverse 
Nyquist loci (sometimes called Whiteley loci) is less familiar than that of 
direct loci, but is in fact slightly simpler, and we shall summarize this 
interpretation for SISO systems. Figure 4.17 shows a typical inverse locus 
1/I(j@), when I(s) is the forward-path (SISO) transfer function, as shown in 
Figure 4.18, except for a constant gain f, which is considered to be separated 
out from I(s). The first noticeable feature is that the inverse locus has a small 
magnitude at low frequencies, and its magnitude increases with frequency. In 
order to obtain closed-loop stability the locus must satisfy the inverse 
Nyquist stability criterion, which is just the generalized inverse Nyquist 
criterion, but with the set of inverse characteristic loci replaced by the single 
locus. Since we have factored out the gain f from the return ratio, the locus 
of 1/1(j@) must encircle the point — f+ j0 anticlockwise the same number of 
times as I(s) has right half-plane zeros (which is the same as the locus of 
1/(fI(j@)) encircling the point — 1+ j0 that number of times), and typically 
the inverse locus will pass ‘above’ —f+j0, as shown in Figure 4.17, if the 
closed loop is stable. 
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Figure 4.17 Performance prediction from an inverse Nyquist locus. 


Applying phase advance to the return ratio ‘bends’ the inverse locus in 
a clockwise sense. If we label the point 1//(0) as A, the point at which the locus 
crosses the negative real axis as B, the point — f + j0 as C, and the point on 
the locus nearest C as D, then SISO performance and robustness measures — 
are obtained as follows. The gain margin is |OB/OC| (where |OC] is the 
distance from the origin to C, etc.), and the phase margin is the angle ¢ 
subtended by the circular arc, centred on the origin, drawn from C to the 
locus. The steady-state error (as a fraction of a constant reference signal) is 
|OA/AC|, and the peak dynamic magnification M, is f/|CD|. The closed- 
loop (— 3 dB) bandwidth is given by the frequency at point E on the locus, 
which is the point at which the locus intersects a circle, centred on C, of radius 
f,/2. It is easy to show that these relationships hold, and this is left as an 
exercise for the reader: (Exercise 4.3). 

The overall design strategy is now clear. First attempt to obtain 
row diagonal dominance of K,(s)K,G(s), using appropriate compensators 
K, and K,(s). (The next section explains how such compensators may be 
sought.) Then design a SISO compensator k,(s)/f; for each of the loops, 
using SISO techniques, until all the Ostrowski bands are satisfactory (when 
interpreted as SISO inverse Nyquist loci). Assemble the compensator 
K.(s)= diag {k, (s)/f;:i=1,...,m}. As with any design technique, this stage 


Figure 4.18 Factoring out a gain f from the return ratio. 
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must be followed by extensive analysis of the proposed design, using both the 
frequency-domain techniques presented in the previous chapter and time- 
domain simulation. 

The advantage of the inverse Nyquist array method over the direct 
method is that the use of Ostrowski bands allows the behaviour of individual 
loops to be predicted, taking into account the effects of interactions with 
other loops. No such relationship is available for the direct method.” A 
significant disadvantage, on the other hand, is that the final inverse com- 
pensator, K,(s)K,(s)K,, must correspond to a realizable compensator 
K,K,(s)K,(s), since it is K,K,(s)K,(s) that is to be implemented. (If it is 
invertible, the most satisfactory way of performing the inversion is to obtain a 
state-space realization of K,(s)K,(s)K,, and then use the algorithm given in 
Chapter 8—unless the complete compensator is diagonal, in which case the 
individual elements can be inverted.) 

Ostrowski bands have one further use, which can be exploited even if 
the INA technique is not used to design a compensator. Since each band 
encloses the inverse Nyquist locus corresponding to one loop, when all the 
other loops are closed with their feedback gains at the design values, it can be 
used to indicate how the stability of the complete system changes when the 
gain in that loop changes. In particular, it can be used to indicate whether the 
system would be stable if one loop failed, while all other loops remained at 
their design gains. This indication is less conservative than the one obtained 
by using Gershgorin bands, described in Chapter 3. On the other hand, the 
Gershgorin bands can be used to predict stability when the gains in all the 
loops change simultaneously, whereas the Ostrowski bands cannot. 


4.6.3 The direct Nyquist-array (DNA) method 


Here one works with G(jæœ) rather than G( jm), and designs K,, K,(s) and 
K,(s) directly. This has several advantages. First, the designed compensator 
does not need to be inverted, and one consequence of this is that any structure 
imposed by the designer, such as setting certain elements to zero, is retained. 
It also gives the designer more freedom in the choice of compensator, since its 
inverse does not need to be realizable. Secondly, the plant need not be square, 
since it need not have an inverse. Thirdly, it may be that the inverse 
compensator designed by the INA method has right half-plane zeros, particu- 
larly if it is designed semi-automatically by one of the methods described 
in the next section. In this case the compensator itself will be open-loop 
unstable, which is usually very undesirable. With the DNA method one can 
keep control over the location of the compensator poles more easily. 
However, there is no tool, such as the Ostrowski bands, which allows 
the designer to take into account the effects of interactions between loops. 
(Ostrowski bands can still be drawn for direct Nyquist arrays, of course, but 
their meaning is lost.) Suppose that diagonal dominance of a direct return 
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ratio GK,,K,(s) has been achieved, and that the diagonal elements of its 
Nyquist array, together with their Gershgorin bands, are examined in order 
to choose SISO compensators for the individual loops (that is, to choose the 
elements of the diagonal compensator K,(s)). Closed-loop stability will be 
predicted correctly by using the Gershgorin bands. But if the performance of 
a loop is predicted by treating the locus of its diagonal element as if it were the 
locus of a SISO system, then the prediction will be correct only if that loop is 
closed, while all the other loops remain open. As soon as the other loops are 
closed, their behaviour will affect the behaviour of the loop being designed. 

This will not matter if, by this stage of the design, interaction between 
the loops has already been almost eliminated, so that the Gershgorin bands 
are very narrow, and the system behaves almost like a number of separate 
SISO systems. Therefore when using the DNA method it is important not to 
be content with merely achieving diagonal dominance, but to try to obtain as 
large a degree of dominance as possible before attempting to compensate the 
individual loops. 

This requirement can be relaxed somewhat. We know that closed- 
loop behaviour will be satisfactory (low interaction, low sensitivity) whenever 
all the loop gains are large, whatever the details of the design. So we need not 
strive to maximize diagonal dominance at those (low) frequencies at which we 
expect the loop gains to be large, namely frequencies significantly lower than 
the 0 dB cross-over frequency of any loop. 

If a high degree of diagonal dominance cannot be obtained, it is 
possible to go ahead and design a compensator for each loop so as to obtain 
apparently promising behaviour of the Gershgorin bands. But this leads to a 
very ad hoc design procedure which relies on luck. If the resulting closed-loop 
behaviour is unsatisfactory, it is not clear what can be done to improve it, at 
least within the confines of the Nyquist-array method. It may be possible to 
proceed in a systematic way, however, by switching to some other design 
method at this point. For example, if the behaviour of only one loop 
remained unsatisfactory, one could examine the open-loop characteristics of 
that loop with all the other loops closed, and compensate it by using SISO 
methods —in effect, switching to sequential loop closing at this point. 


4.7 Achieving diagonal dominance 


Nyquist-array methods are effective, provided one can obtain the required 
degree of diagonal dominance fairly easily. In many cases, however, this is 
very difficult to achieve. Manual cut-and-try methods, which were the ones 
originally suggested, fail to achieve diagonal dominance (with reasonable 
effort) more often than not, and this is probably the main reason why the 
methods have not been employed very widely. In recent years, however, some 
successful automatic ways of achieving dominance have been proposed, and 
these promise to give a new lease of life to the Nyquist-array approach. 
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It should always be remembered, though, that for some plants it 
would be entirely unreasonable to expect to obtain diagonal dominance, or a 
high degree of dominance, while using control signals of reasonable ampli- 
tude. In a highly manoeuvrable ship, for example, one would not expect to be 
able to decouple the ship’s rate of turning from its roll angle to any 
appreciable degree. And one should bear in mind that diagonal dominance is 
not a necessary precondition for feedback design, as the other techniques in 
this and later chapters show. Thus persistent failure to achieve an adequate 
degree of diagonal dominance probably indicates that it is time to try a 
different approach. 


4.7.1 Cut and try 


It is sometimes possible to examine the display of a Nyquist array and 
observe that some straightforward transformation will achieve diagonal 
dominance. A simple but contrived example will show what is involved. 
Suppose the plant transfer function is 


—s st+l 
3s+2 3s+2 
Gael" à (4.130) 
s+2 -s 
3s+2 3s+2 


so that the inverse transfer function is 


G(s) = Ee n ‘| (4.131) 


This is clearly neither row dominant nor column dominant anywhere on the 
Nyquist contour. If we wish to use the INA method we look for operations 
on the rows of G(s) which could produce dominance. (Row operations 
correspond to pre-multiplications of G(s), which correspond to post- 
multiplications of G(s), and hence to pre-compensation of the plant.) Since 
1912(s)| > 1922(s)| and [ĝ21(9)| > 1911(s)| for any s, row dominance (and 
column dominance, in this case) can clearly be achieved by interchanging the 
two rows. Mathematically, this corresponds to pre-multiplication by 


> fo1 
geli al (4.132) 


and hence to post-multiplication by the pre-compensator transfer function 


0 1 
Be A (4.133) 
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so that the compensated plant is represented by 


s+1 —s 
3s+2 3s+2 
G(s)K, = (4.134) 
—S s+2 
3s+2 3s+2 


Physically, this corresponds to nothing more than a re-ordering of the inputs 
(or a re-assignment of inputs to outputs, if we take the view that the principal 
role of each input is to control one output). 

In this artificial example both the direct and the inverse array have 
been made equally dominant, and the compensation required in each case is 
the same. This is not usually true. It is quite possible for a particular 
compensator to make the direct array dominant, but not the inverse array, 
and vice versa. 

The ‘elementary matrices’ which figure in Theorem 4.2 are supposed 
to represent simple transformations devised by the designer. Thus, for 
example, the transformation ‘subtract 2j@ times the first column from the 
second column’ results in the pre-compensator 


K,(s) = F T (4.135) 


(for a two-input, two-output system). If column dominance is being sought 
then each column can be scaled by any scalar function without affecting the 
degree of dominance, so this compensator can be made realizable by dividing 
the second column by s: 


1 -2 
Kla i A (4.136) 


In practice, it is rarely possible to make much progress by relying on 
being able to find such transformations by ad hoc means. 

An alternative strategy is to try to diagonalize (i.e. completely de- 
couple) a system at one frequency, and hope that the effect will be sufficiently 
beneficial over a wide range of frequencies to result in diagonal dominance 
being attained. The easiest way to do this is to invert the system at one 
frequency. If the system has no poles at the origin, then K = G~'(0) is a 
realizable (because constant) compensator. At any other frequency G~!(ja) 
is complex, but can be approximated (with luck) by a real constant compen- 
sator. Fortunately, we already have an algorithm for performing the required 
approximation: the ALIGN algorithm introduced in Section 4.3.1 above. 

If we extend this idea to diagonalizing the system at several fre- 
quencies, and also to using dynamic compensators to do so, then we enter the 
realm of ‘pseudo-diagonalization’, to be described in Section 4.7.3. 
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EXAMPLE 4.4 


A plant has the inverse transfer function 


2 
s*+s+2 
—— 10s+3 
x s+1 
G(s) = is (4.137) 
s‘+ts+l 
5s+1 — 
s+2 


It is not diagonally dominant at low frequencies, since 


z 23 
G(0) = i 
(0) l 1°0 | (4.138) 
The compensator 
> —0.25 1.5 
K, = G(0) = A 
»= GO) | ee | (4.139) 


gives K,G(0) = J, which is obviously diagonally dominant, and it gives 
column, but not row, dominance at high frequencies, since 


er 


K,G(s)> s as|s|— œ (4.140) 


4.7.2 Perron-Frobenius theory 


As we saw in Section 2.10, Perron-Frobenius theory allows us to check 
whether a plant can be made diagonally dominant by input and output 
scaling. This can clearly be applied to design, but with a proviso: in general, 
we must be careful about using output scaling as part of the strategy for 
achieving diagonal dominance. On the face of it, output scaling corresponds 
to inserting a post-compensator (that is, inserting a compensator between the 
outputs and the variables being controlled). This is physically impossible, 
since the meaningful plant outputs (which are variables such as velocity, or 
thickness of steel strip) cannot be affected by mathematical operations. It is 
certainly possible to change the scaling of the measurements of the output 
variables, however, and Figure 4.19 shows how this is achieved by placing the 
post-compensator in the feedback path. 

One can use the scalings S~!, at the plant input, and S, in the feedback 
path, to obtain a diagonally dominant return ratio at the point labelled X in 
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Figure 4.19 Scaling a plant’s inputs and outputs. 


Figure 4.19. But we must be wary of falling into the trap of believing that this 
return ratio tells us anything about interaction at the plant output. The 
output variables may be interacting with each other to a considerable extent, 
and this interaction may be being hidden by the measurement scaling S. Thus 
the choice of measurement scaling is really part of the specification of the 
feedback design problem, and changing this scaling is not usually admissible 
as a step in a design technique. The output scaling is, in effect, built into the 
definition of the plant. If the specification is rather loose, however (for 
example, that closed-loop stability and good long-term following of set points 
are to be achieved, with no specification on interaction), then output scaling 
may be considered. 

Fortunately, the Perron—Frobenius theory gives useful results, even if 
only pre-compensation (input scaling) is allowed. 

The basic result of Perron-Frobenius theory has already been stated 
in Section 2.10: that a positive, primitive square matrix has a particular 
distinguished eigenvalue 4p (the Perron—Frobenius eigenvalue), which is real, 
positive and greater than the magnitude of any other eigenvalue; furthermore, 
the elements of the corresponding eigenvector may all be chosen to be real 
and positive. 

The next result we need is the following: 


Theorem 4.4 (Seneta, 1973): Let M be a positive, primitive square 
matrix with Perron-Frobenius eigenvalue Ap. Then, for any vector x 
with real positive elements 


sEm x; < p < max D myx A (4.141) 


i t 


and both TET become equalities if x is chosen to be the 
Perron-Frobenius right eigenvector of M (that is, if Mx = ApX). 


To exploit this, we define the normalized comparison matrix 


M ís) = abs(G gaz (Ss) G(s) (4.142) 
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where Gaiag(S) is the matrix containing only the principal diagonal of G(s). 
(We have already seen a similar matrix in Theorem 2.12.) Now, suppose that 
we have a diagonal pre-compensator K(s) = diag{k,(s),..., k,,(s)}, such 
that G(j@)K(j@) is row dominant. Then 


2, Igu (jok; (jo) < |gulj)k,(je)| (4.143) 


Since the (i, j) element of M (s) is 


= | gij(s)| 
m,,(s) = mel (4.144) 


the inequality (4.143) is equivalent to 


dm ii o) fam < <1 (4.145) 


or, since m,;(s) = 1, to 


kj(jo) _ 
kj o) ~ 


(4.146) 


2 myl 


Furthermore, the diagonal pre-compensator which gives the greatest degree 
of row dominance is the one which gives the smallest value of the sum in 
(4.146). 

Now, Theorem 4.4 tells us immediately that the smallest value which 
can be obtained is åp (jæ), and that the best choice for each (diagonal) element 
of K (jæ) is the corresponding element of the Perron—Frobenius right eigen- 
vector of M (jæ). Note that if Ap(jw@) > 2, then there is no possibility of 
obtaining dominance at frequency œ by using a diagonal pre-compensator. 
Also, the optimal choice of the elements of K(ja) gives the same degree of 
dominance in each row of G(jw)K(jo). 

The exact realization of the compensator K(jw) is obviously a 
problem. It can be overcome quite easily, however, by taking one of two 
approaches. The approach advocated by Mees (1981) is to define the constant 
matrix T with elements 


t; = max {m,;(jo)} (4.147) 


and apply the previous reasoning to T instead of M(ja). If the Perron- 
Frobenius eigenvalue of T is less than 2, then choose the elements of the 
constant diagonal compensator K to be the elements of the Perron— 
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Frobenius right eigenvector of T. This will make TK row dominant: 


», tk; < k; (4.148) 
(since t; == 1). But m; (jæ) < t,;, so (4.145) will certainly hold, and hence 
G(jæ)K will be row dominant. 

This approach has the advantage of yielding a constant, real pre- 
compensator. The disadvantage is that the Perron-Frobenius eigenvalue of T 
is usually greater than that of M (jœ) at any frequency œw, so it may well be 
impossible to make TK dominant, while a dynamic diagonal compensator 
K(s) may exist which makes G(jw)K (jæ) dominant. Munro (1985, 1987) has 
pointed out that K (jæ) need not be real, provided the magnitudes of its 
elements follow the frequency variations of the elements of the Perron— 
Frobenius eigenvector of M (jœ). He has therefore suggested that a diagonal 
K (s) should be chosen whose elements behave approximately like those of the 
eigenvector. Since the eigenvector is unique up to scaling only, one diagonal 
element of K (s) can be constant. In practice, it is often effective to display the 
variation of each element of the eigenvector (suitably normalized, usually by 
fixing one element) on a Bode plot, and find a simple transfer function whose 
gain matches this variation approximately. A further advantage of using a 
dynamic compensator is that the optimal degree of dominance can be 
approached at each frequency. N 7 

When using the INA method, dominance of K (ja) G(jo) is sought. In 
this case the matrix 


M(s) = abs(G(s) Ggic,(s)) (4.149) 


is used in place of (4.142), its Perron—Frobenius left eigenvector is used to 
construct K (jæ), and column dominance is obtained. Everything else remains 
the same, except that if a dynamic compensator is used then the elements of 
K(s) must be chosen to have realizable inverses. 

A drawback of using diagonal compensators to achieve diagonal 
dominance is that only row dominance can be obtained when using the DNA 
method, and only column dominance can be obtained when using the INA 
method. This is exactly the opposite of what we would like, since further 
diagonal compensation, for the purpose of ‘loop shaping’, may destroy the 
dominance which has been achieved. It will not destroy it, however, if the 
additional compensation is made the same in each loop. 

If both pre- and post-compensation is used, as shown in Figure 4.19 
(when this is permissible), and the pre-compensator S~! is chosen as de- 
scribed above, then SGS ~' will be both row and column dominant (provided 
Ap < 2). (This is the content of Theorem 2.12; the proof is left as an exercise 
for the reader.) Furthermore, the degree of row and column dominance will 
be the same. In this case further diagonal pre-compensation will leave the 
column dominance unaltered. 
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EXAMPLE 4.5 (Munro, 1985) 


Consider the transfer function 


s+4 1 
s+i)(s+5 5s+1 
G(s) = ( M ) (4.150) 
s+1 2 


s?+10s+100 2s+1 


The inverse Nyquist array G( jæ) is not column dominant. The Perron- 
Frobenius eigenvalue of abs(G(ja) Ggize(J@)) is shown on a Bode plot in 
Figure 4.20. Its value is smaller than 6 dB (i.e. Ap < 2) at all frequencies, so 
it is possible to obtain column dominance by using a diagonal compen- 
sator. Figure 4.21 shows the second element of the Perron—Frobenius left 
eigenvector, when the first element is fixed at 1. It also shows, super- 
imposed, the gain variation of the compensator 


fig = OATS 3.574 


2(s) = 40566 (4.151) 


This matches the variation of the eigenvector very well, and the in- 
verse compensator K(s) = diag{1,k,(s)} gives the Nyquist array for 


Gain (dB) 


1072 107! 10° 10! 10? 
Frequency (rad s~ ') 


Figure 4.20 Perron—Frobenius eigenvalue of the inverse of system (4.150). 
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Gain (dB) 


10°? 107! 10° 10 10? 


Frequency (rad s~ ') 


Figure 4.21 The second element of the Perron—Frobenius left eigenvector of 
the inverse of system (4.150), with the first element fixed at 1 (solid curve), and 
the gain of the SISO compensator (4.151) (dotted curve), 


20 


10 
Re 


Figure 4.22 Nyquist array of KG. 
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K (jw) G(jo) shown in Figure 4.22. The Gershgorin circles are computed 
by columns, and the array is seen to be column dominant. 


As discussed in Section 2.10, if abs(G(jq@)) is not primitive then G(ja@) 
can usually be decomposed into subsystems, and the Perron—Frobenius 
theory should then be applied independently to those blocks of G (jæ) which 
are on its principal diagonal. 

Kantor and Andres (1979) have obtained results which are identical to 
those obtained by Mees (1981), but using the theory of M-matrices (Fiedler 
and Ptak, 1966; Siljak, 1978) instead of Perron—Frobenius theory. They 
examine the most positive eigenvalue of I — M, rather than M. Dominance 
can be obtained if this eigenvalue is smaller than 1, and again a suitable 
compensator is obtained from the corresponding eigenvector. 


4.7.3 Pseudo-diagonalization 


It may not be possible to achieve diagonal dominance by using a diagonal 
compensator. And it is certainly not possible to change the column domi- 
nance of a direct Nyquist array, or the row dominance of an inverse array, by 
using diagonal pre-compensators. So we need to look for ways of auto- 
matically generating compensators with a more general structure. This can be 
done by choosing some measure of diagonal dominance, some compensator 
structure, and then optimizing the measure of dominance over this structure. 

We shall use the term pseudo-diagonalization for any such scheme, 
although the term is often reserved for the particular scheme proposed by 
Hawkins (1972). Hawkins assumed that inverse arrays are to be used, but his 
method can be applied equally well to direct arrays. If we have a plant G(s) 
and a constant compensator K, with Q(s) = G(s)K, then individual elements 
of Q are given by 


q;ljo) = gi (Jo)k; (4.152) 


where g7 (s) denotes the ith row of G(s), and k; denotes the jth column of K. 
Hawkins proposed to minimize 


J= >, pd, lasje} (4.153) 
k=1 itj 
N 
= bs Pe} È lgi (jox) KP (4.154) 
c= i+j 


subject to the constraint 


LAR (4.155) 
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Here {p,} is a set of weighting coefficients, and (4.153) is formulated so as to 
achieve column dominance. By using Lagrange multiplier methods, this can 
be shown to lead to an eigenvalue problem, so the solution can be obtained 
easily. But a shortcoming of the method is that minimizing J; may not 
prevent |q; (j@,)| from being made small, as well as the off-diagonal elements, 
so that diagonal dominance may not be obtained. 

Suppose, however, that we replace the criterion (4.153) by 


È nd X lautionr| 


Pr 
J= 7 
y Pilay Go) 
k=1 


J 


(4.156) 


This is a better measure of column dominance, and has the further advantage 
that it can be minimized without any constraints on K. Equation (4.156) can 
be rewritten as 


Zad > lg? (jor) KP} 
J;= k ate) (4.157) 
2 PrlgF (jo) k;l? 


which is just the reciprocal of the cost function which is maximized by the 
multi-frequency ALIGN algorithm; this has already been described in 
Section 4.3.1. Note that, since (4.157) is to be minimized, while (4.45) is to be 
maximized, the multi-frequency ALIGN algorithm can be used without 
modification to solve the pseudo-diagonalization problem. 


Ford and Daly (1979) have extended this approach to dynamic 
compensators: 


K (s) = [k,(s), .. . ,Km(S)] (4.158) 
in which each column has the form* 

k;(s) = koj+ ... +kgjs? (4.159) 
and each k;j is a column vector. In this case (4.152) is replaced by 

q;(jo) = y; (jo) n; (4.160) 
where y7 (jæ) is the row vector 


vi (jo) = Loi (jo), jog? (jo), .. . (jo) g7 (j@)] (4.161) 
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and y; is the column vector 
n= [kj ae Keil (4.162) 
To maximize column dominance we now minimize the function 


Zad 2 lv? Gor) nf 


» Pal; (jor) n; 


jee 


J 


(4.163) 


and the solution is again given by the multi-frequency ALIGN algorithm. 

Ford and Daly (1979) also replace the sum E, p, {...} by the inte- 
gration fo p(w) {. - . }dæ. This makes no difference to the development given 
above, although it dead, affects the way in which the matrices X and Ť (in 
(4.47) and (4.48)) are formed. In either case the weight p, or p(w) is used to 
emphasize frequencies where dominance must be increased. 

If column dominance of G(s)K(s) is achieved, it is not destroyed by 
any scaling of the columns of K(s). A realizable compensator is therefore ` 
obtained by dividing k,(s) by any polynomial of degree $ (or greater). This 
polynomial should of course be chosen to have stable roots, the locations of 
which can be chosen to provide desirable loop compensation. The question of 
realizability can be postponed until the whole compensator, including indi- 
vidual loop shaping, has been designed; less adjustment may be necessary at 
that stage, since additional poles may have been added by then. 

Pseudo-diagonalization can be applied to either direct or inverse 
Nyquist arrays. (As usual, the roles of rows and columns are interchanged 
with inverse arrays.) But a practical difficulty arises if a dynamic compensator 
is found for an inverse array: its inverse needs to be realizable. Each row (of 
the inverse compensator) can be scaled to make the inverse realizable. One 
procedure is to ensure that each element of K (oo) is non-zero, obtain a state- 
space realization, and then obtain the realization of K(s), using the algorithm 
given in Section 8.3.4. 

It should be noted that, in our development of pseudo-diagonaliz- 
ation, we have not assumed that G and K are square. The technique can 
therefore be used with plants which have unequal numbers of inputs and 
outputs. 

Although we have described pseudo-diagonalization in the context of 
Nyquist array methods, it can clearly be applied whenever approximate 
inverses of frequency responses are required. In particular, it can be viewed as 
an extension of the ALIGN algorithm, and can therefore be applied in the 
context of the characteristic-locus method. 
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4.8 Design example 


4.8.1 The design 


(1) PLANT AND SPECIFICATION 


We shall use the same plant model and specification as we used for the 
characteristic-locus design example of Section 4.4. As the plant we take 
the three-input, three-output, five-state aircraft model AIRC, described in the 
Appendix, which is stable, with one pole at the origin, and has no finite 
transmission zeros. We aim to achieve a bandwidth of about 10rads~! for 
each loop, with little interaction between outputs, good damping of step 
responses, and zero steady-state error in response to step demands or 
disturbances, with a one-degree-of-freedom control structure. 


(2) OBTAINING COLUMN DOMINANCE 


The plant itself is not column dominant. Figure 4.23 shows the column 
dominance measure 


2 Igu(jo)| 


igo) (4.164) 


Gain (dB) 


107? 1072 107! 10° 10! 10? 
Frequency (rad s~!) 


Figure 4.23 Column dominances of the plant: column 1 (solid curve), column 2 
(dashed curve) and column 3 (dotted curve). 
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for each column, in the form of a Bode plot. Values below 0 dB indicate that a 
column is diagonally dominant; the more negative the value, the greater the 
degree of dominance. Figure 4.23 shows that column 1 is dominant up to 
about lrads~', but not at higher frequencies, while columns 2 and 3 are 
dominant beyond about 1 rads~', but not at lower frequencies. 

We use pseudo-diagonalization (the algorithm of Ford and Daly 
(1979)) to obtain column dominance, and apply it to one column at 
a time. For this purpose, and in the rest of this design example, all 
frequency responses are evaluated at a set of 50 frequency points, equally 
spaced on a logarithmic scale between 0.001 and 100rads~', except for a 
greater density of points in the region of 0.18 rads~* (chosen for particular 
emphasis because the plant has a very lightly damped pair of resonant poles 
at — 0.018 + 0.183). 

Applying pseudo-diagonalization to the first column and optimizing 
over constant compensator elements only, with uniform weighting on all 
frequencies, was not successful. Diagonal dominance was improved at low 
frequencies, where it was not needed, but remained almost unchanged at high 
frequencies. Frequencies above 0.1 rads~' were therefore weighted 10 times 
as much as lower frequencies, but dominance was still not achieved above 
1rads~'. The weighting was increased further to 100 times as much as at 
lower frequencies, and the optimization was performed over first-order 
dynamic terms of the form ko; +k,;s (where ko; and k; are column 
vectors—see equation (4.159)). This produced virtually perfect diagonal 
dominance, with a dominance measure less than 1076 at all frequencies. The 
designed first column of the compensator is (except for arbitrary scaling by a 
scalar transfer function) 


— 1.52 x 10735 — 9.78 x 1074 
k,(s)=| — 1.83 x 1074s +3.11 x 1075 (4.165) 
— 4.05 x 107? s — 1.31 x 1073 


The degree of dominance achieved here is necessarily great, and the 
practical consequence will certainly be very severe exertion by the control 
effectors—in particular, by the two control surfaces. Some further exper- 
imentation would therefore be worth while in a real design at this stage, to see 
whether a higher value of the dominance measure (say 107?) might not be 
obtained, possibly with a simpler compensator structure. Only one element of 
the column may need to be dynamic, for example. 

For the second column, it was again necessary to optimize over a first- 
order dynamic structure, and to use frequency weighting: frequencies below 
lrads~! were weighted 100 times as much as higher frequencies. Again, 
almost perfect dominance was obtained, with a dominance measure better 
than 2 x 1074 everywhere. The design obtained for the second column of the 
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Gain (dB) 
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Figure 4.24 Successive improvement of the dominance of column 3 (see text). 


compensator is 


Os — 1.22 x 107? 
k,(s)=| — 5.74 x 1072s — 4.50 x 1073 (4.166) 
Os — 3.40 x 107? 


Note that first-order structures were not needed for the first and third 
elements. The computed coefficients of s in these elements were of the order 
of 1076. 

The third column proved to be the most difficult to compensate. 
Optimizing over a first-order structure with high weighting on low 
frequencies (< 1rads~*) achieved dominance up to about lrads~', but 
lost it at higher frequencies (thus reversing the original dominance properties 
of this column)—see curve A in Figure 4.24. Reducing the weighting on the 
low frequencies gave no benefit at higher frequencies, so optimization over 
a second-order structure 


k,(s) = ko3 + kis + ky48? (4.167) 
was attempted, with uniform weighting at all frequencies. This achieved 


dominance everywhere, except in a narrow range of frequencies near 
0.2rads~' (which is where the plant resonance occurs) and gave poor 
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Figure 4.25 Column dominances after compensation: column 1 (solid curve), 
column 2 (dashed curve) and column 3 (dotted curve). 


dominance between 0.1 and lrads~‘~see curve B in Figure 4.24. The 
weighting on frequencies between 0.1 and 1 rad s~ 1 was therefore increased to 
10 times as much as on other frequencies, and this gave diagonal dominance 
at all frequencies, with a dominance measure less than 0.25 everywhere — see 
curve C in Figure 4.24. The design obtained for the third column of the 
compensator is 


4.21 x 1072s? — 9.78 x 1077s — 1.14 x 107? 
k3(s) =| 5.07 x 1073s? — 2.58 x 1077s — 3.34 x 1077 (4.168) 
2.86 x 107152 — 3.08 x 107?s — 1.52 x 107! 


Figure 4.25 shows the dominance measures obtained for each of the 
columns. We shall refer to the compensator designed so far as K,(s), for 
consistency with the notation used in Section 4.6. 

Note that 


K,(s) = [ky (s), k2(s), ka(s)] (4.169) 


where k,(s), k2(s) and k3(s) are defined by (4.165), (4.166) and (4.168), 
respectively. 
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(3) LOOP COMPENSATION 


The first two columns of G(jw)K, (jæ) are so dominant that the Gershgorin 
circles, superimposed on the (1, 1) and (2, 2) loci, cannot be distinguished from 
the loci themselves, and the compensation of the first two loops is no different 
from compensation of a SISO system. 

The frequency response of the (1, 1) element looks exactly like that of 
an integrator, and lies along the negative imaginary axis. Figure 4.26 shows 
the Bode magnitude response of this element. The only adjustment needed 
here is an increase in gain, in order to obtain the required bandwidth. An 
increase of 72 dB (i.e. a gain of 3980) brings the 0 dB cross-over frequency up 
to 8rads~*, and since the phase is — 90° everywhere, the closed-loop 
(— 3dB) bandwidth should be the same, or very slightly higher (see Figure 
1.15). Figure 4.26 also shows the gain characteristic after the gain has been 
increased. 

The frequency response of the (2,2) element, shown as a Nyquist plot 
in Figure 4.27, is essentially constant at — 25 dB (= 1/17.4) at all frequencies. 
This can be compensated easily, by first changing the sign to obtain a positive 
constant, and then inserting an integrator with enough gain to move the 0dB 
cross-over frequency to about 10 rads~*. Thus the compensation needed for 
the second loop is — 174/s. The result looks like a perfect integrator charac- 
teristic with the correct cross-over frequency (not shown). 

The response of the (3, 3) element is shown as a Nyquist plot in Figure 
4.28, together with its Gershgorin band (computed column-wise), and its 
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Figure 4.26 Response of (1, 1) element with and without compensation. 
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Figure 4.27 Response of (2, 2) element before compensation. 
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Figure 4.28 Response of (3, 3) element before compensation. 


DESIGN EXAMPLE 195 


Gain (dB) 


4 L f L L SS CS S O cs Se EE E 


1073 10°? 107? 10° 10! 10? 


Frequency (rad s~')} 


Figure 4.29 Gain of (3, 3) element before compensation. 


magnitude is shown in Bode form in Figure 4.29. Suitable compensation of 
this element is obtained by first changing its sign, so that the locus starts and 
ends on the positive real axis, and then inserting an integrator with enough 
gain to add about 10dB at 10rads~*. The compensator — 31.6/s achieves 
this, and the compensated magnitude characteristic is shown in Figure 4.30. 
For this element the thickness of the Gershgorin band is significant, so we 
should check that the band does not overlap — 1 in order to be sure that our 
inference of closed-loop stability is correct. Figure 4.31 shows the compen- 
sated (3, 3) element, together with its (column) Gershgorin band, in the form 
of a Nichols chart. This shows that the Gershgorin band is well clear of — 1 
(i.e. of the point (0dB, —180°)) and our design looks satisfactory. 

The three loop compensators we have designed at this stage can be 
assembled into the diagonal matrix 


3980 0 0 
174 
i Se ng 
K,(s) = 5 (4.170) 
31.6 
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Figure 4.30 Gain of (3, 3) element after compensation. 
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Figure 4.31 Response of (3,3) element after compensation, with Gershgorin 
band, on a Nichols chart. 
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The compensation is not yet finished, because the product K,(s)K,(s) is not 
realizable: each element in the first and third columns has one more zero than 
poles. We obtain a realizable compensator by inserting some more poles. For 
simplicity we multiply each element of K,(s) by 1/(0.05s + 1); this gives an 
additional phase lag of 26° at 10rads~‘, and so should not reduce stability 
margins significantly, while reducing the loop gain above 20 rads~’, thus 
reducing noise transmission. 


(4) REALIZATION OF THE COMPENSATOR 


With this modification, the complete compensator has the five-state realiz- 
ation (KA, KB, KC, KD), where 


| 0.0204 0.0898 —1.1977 0.0178 0.0834 
0.0299 —0.2042 2.0407 04695 —0.8313 
KA =| —0.3410 1.5291 — 19.8216 0.0360 += — 0.0628 
0.4771 1.2183 0.0969 —19.9707 —0.0531 
L 0.1436 —0.4167 —0.0517 —0.0098 —19.9831 
(4.171) 
0.0053 —0.0162 —0.0250 
0.0458 0.0794 —0.0411 
KB=| 0.0230 —0,9959 —0.0370 (4.172) 
—0.4720 —0.0392 0.8783 
0.8801 0.0010 0.4743 
0 0 25.7 592.1 2351.9 
KC=| 0 0 —1973 —774 297.5 (4.173) 
00 -02 193.8 7306.5 
—121.3929 0  —26.6305 
KD=| —145671 0  —3.2056 (4.174) 
— 322.1833 0 —180.7996 


4.8.2 Analysis of the design 


The characteristic loci of the design are shown in Figure 4.32, together with 
part of the ms 2 M-circle. These apparently indicate ample stability margins. 
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Figure 4.32 Characteristic loci, with 3dB M-circle. 


Figure 4.33 shows the largest and smallest principal gains of the 
closed-loop transfer function 
T(s) = G(s)K(s) [I + G(s)K(s)]~* 


where G(s) is the plant transfer function, and K(s) is the final compensator 
transfer function. These show that, at the output of the plant, the design 
specification has been achieved: the low-frequency gain of 0 dB indicates that 
constant demands will be tracked exactly (in the steady state), the small peak 
shows that all responses to changes in demanded values, or to disturbances at 
the output, will be well damped, and the — 3dB bandwidth is seen to be 
between 10 and 15rads~‘, depending on the signal direction. The time 
responses to step demands on each of the outputs are plotted in Figure 4.34 
and show excellent behaviour, interaction from the first two outputs being so 
small as to be invisible. 

In spite of the apparently excellent results shown in Figures 4.32-4.34, 
the design suffers from a serious flaw. The compensator has exactly cancelled 
the resonant pair of plant poles at — 0.018 + 0.182j, thus making these 
resonant modes uncontrollable from the compensator input and unobserv- 
able from the compensator output, leaving them as modes of the closed-loop 
system. This is apparent from examination of the eigenvalues of the matrix A 
of the closed-loop system, or of the transmission zeros of the compensator. 
These modes are not visible as long as we examine the transfer functions 
GK (I+GK)~‘ (Figure 4.33) or KG(I+KG)~* (whose largest and smallest 


(4.175) 
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Figure 4.33 Largest and smallest closed-loop principal gains. 
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Figure 4.34 Closed-loop step responses to step demand on output 1 (solid 
curves), output 2 (dashed curves) and output 3 (dash-dotted curves). 
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Figure 4.35 Largest and smallest principal gains at plant input. 


principal gains are shown in Figure 4.35). But they become apparent once we 
consider how any disturbances at the plant inputs may propagate to the plant 
outputs, in other words once we examine the transfer function SG. Figure 
4.36 shows the largest principal gain of this transfer function; a sharp 
resonant peak at 0.18 rad s7 is very prominent. The effect of this resonance is 
that all the controlled variables are liable to oscillate with a very slow decay 
(e~ 9-918"), and low frequency (0.18 rads~*, or 0.03 Hz), in response to a 
disturbance which affects the control inputs.* Figure 4.37 shows the time 
responses of the outputs after a unit impulse disturbance on the first input 
(spoiler angle). Initial large transients are removed quickly, as would be 
expected from the earlier figures, but the beginning of a slow subsequent 
oscillation is visible, particularly on output 1 (altitude). 

The existence of this resonance in the closed-loop behaviour does not 
necessarily invalidate the design. Its effect needs to be examined in the light of 
detailed knowledge of the specific application. Disturbances are likely to 
appear at the plant inputs, in this case in the form of friction and hysteresis at 
the control surfaces, and of minor variations in the engine thrust. But the 
likely magnitudes of such disturbances, and the consequent effects on the 
aircraft’s handling qualities, need to be assessed before the design is rejected. 

We can also examine the various principal-gain plots from the point of 
view of robustness assessment, in the face of (unstructured) perturbations. In 
terms of the analysis presented in Section 3.11.1, Figure 4.33 tells us that the 
system remains stable, provided any output multiplicative perturbations A, 
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Figure 4.36 Largest (closed-loop) principal gain from plant inputs to plant 
outputs. 
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Figure 4.37 Responses to impulse disturbance on plant input 1. 
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(such that G = (I + A,)Go, where Go is the nominal plant model) satisfy the 
condition || A,|],, < 1/1.12 (= — 1dB, since the peak principal gain value in 
Figure 4.33 is + 1 dB). Figure 4.35 shows that input multiplicative pertur- 
bations A; (where G=G,(I + A,)) do not cause instability, provided 
| A; ||. < 1/2.1 (= —6.4dB). (These assessments of stability robustness 
assume that no unstable poles are introduced by the perturbations.) These 
stability margins seem quite reasonable (89% relative uncertainty allowed at 
the output, 48% uncertainty allowed at the input), particularly when one 
recalls that these margins are conservative. 

Figure 4.36, however, shows that the largest stable feedback perturba- 
tion Ap, such that G = Go(I + ApGo) +, which can be guaranteed not to 
destabilize the loop has magnitude || A, ||, < 1/5.6 (= — 15dB). This allows 
only 18% relative uncertainty, which looks rather low. But this figure relates 
only to a narrow band of frequencies around 0.18rads~’, and it is 
legitimate to take the least conservative bound for a given perturbation at 
each frequency. In other words, the 18% figure need cause concern only if we 
can identify possible perturbations which can be represented by the use of Ar, 
but could not be represented by the use of either A, or A;. One such 
perturbation is the migration of the resonant pole pair — 0.018 + 0.182) 
across the stability boundary; whether such a perturbation could occur is 
again a question which cannot be answered here, since it requires detailed 
knowledge of the aerodynamics of the aircraft being modelled. In short, 
Figure 4.36 indicates that there is a possible robustness problem, but more 
detailed analysis is needed in order to evaluate its severity. Such analysis 
could usefully employ the ‘structured uncertainty’ approach taken in 
Section 3.11.2. 

The present design can also be criticized on the grounds that it 
assumes the availability of very large control signals — some of the elements in 
(4.173) and (4.174) have extremely large magnitudes. This problem is largely 
induced by the specification, which asks for a bandwidth which is probably 
unattainably high. The bandwidth specification is rather vague, so some 
refinement of the design to reduce the achieved bandwidth a little, and 
perhaps to reduce the high-frequency ‘roll-off’ rate further by inserting 
further lags in the controller, could be pursued, and may succeed in reducing 
short-term control-signal magnitudes by a factor of two or three. But to 
obtain a reduction of an order of magnitude, which is what is required in 
practice, would probably require abandoning the goal of a 10 rad s~ 1 band- 
width and settling for something like 1 rads’ instead. Some reduction of 
control-signal magnitudes may also be obtained by moving to a design witha 
much lower degree of diagonal dominance. 


4.8.3 Comparison with the characteristic-locus design 


The design produced by the characteristic-locus method, in Section 4.4, 
clearly differs in several respects from the DNA design produced in this 
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section. It is most important to emphasize, then, that these differences arise 
almost entirely from differences in the detailed execution of the design, and 
not from the choice of design technique. 

The fact that the DNA design places zeros on top of resonant poles, 
while the characteristic-locus design shifts all the poles to well-damped 
locations (— 4.02+7.79j, — 3.83 + 6.58j, — 3.16 + 0.23j, — 0.49 + 0.01j, 
— 9.52, — 0.53), is a consequence of the use of optimization over dynamic 
structures in the DNA design, which allows the zeros to be introduced, but 
optimization over only constant structures in the characteristic-locus design, 
which does not. Had we used pseudo-diagonalization to find the approxi- 
mate commutative controller, instead of the basic ALIGN algorithm, then 
such pole—zero cancellation may well have been introduced into the charac- 
teristic-locus design too. A major lesson to be learnt from the example, then, 
is that too free a use of optimization, with any of the techniques presented in 
this chapter, may be unproductive if there are unpleasant modes in the plant 
(this is particularly true if there are actually unstable modes in the plant, of 
course)., 

The facts that the spread of principal gains is larger in Figure 4.13 than 
in Figure 4.33, that the peak is larger, and that the ‘roll-off’ rate is only 
20 dB/decade rather than 40, are all results of specific decisions taken during 
the detailed design, and not of the choice of one design technique or another. 
Further work could be done on either design to make it appear very similar 
to the other. Of course, the fact that interaction is lower in the DNA design 
is again the result of more degrees of freedom having been used in the 
optimization, and is not an inherent advantage of using Nyquist-array 
techniques. 


4.9 Quantitative feedback theory 


The design methods which we have considered so far require design objec- 
tives to be stated in terms which are familiar from classical SISO design 
methods. In the design examples given in Sections 4.4 and 4.8 we have stated 
these requirements very imprecisely, but we could have been more specific 
about bandwidths, about allowed values of principal gains of various 
transfer-function matrices, and about permitted interactions. We could also 
have specified acceptable time-domain responses of the diagonal elements of 
the closed-loop transfer function. Some of these specifications may be derived 
from an accurate knowledge of noise and disturbance statistics, or of possible 
perturbations to the nominal plant model (by means of the methods de- 
scribed in Chapter 3), but they are more often obtained in a less quantitative 
manner. Typically they are initially obtained from previous experience with 
similar plants, and then refined in a specify—design—analyse cycle until the 
closed-loop behaviour is judged to be acceptable. 

This approach has been forcefully criticized as inadequate by 
Horowitz (1982). In Chapter 1 we pointed out that the basic reason for using 
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Figure 4.38 A feedback configuration with two degrees of freedom. 


feedback is to combat uncertainty, in the form of either unpredictable noises 
or disturbances, or unpredictable variations in the behaviour of the plant. If 
we have a quantitative description of the amount of uncertainty which may 
be present, and a precise specification of the range of behaviours which may 
be tolerated in the face of such uncertainty, then we should aim to develop a 
design technique which allows us to proceed systematically to satisfy this 
specification. 

One such technique which has been proposed is known as quantitative 
feedback theory (Horowitz, 1979, 1982; Horowitz and Sidi, 1980) often ab- 
breviated to QFT. (The H „ theory described in Chapter 6 is an alternative 
technique with the same aim.) The QFT approach assumes that the plant 
uncertainty is represented by a set of templates on the complex plane, each of 
which encloses within it all the possible frequency responses g;; (Jœ) from 
some input j to some output i at some frequency «,, as described in Section 
3.10.3. It also assumes that the design specification is in the form of bounds on 
the magnitudes of the elements of the frequency-response matrices T( jo) 
and/or S(ja); for example, 


a;;(@) < |t,(j@)| < b; (œ) (4.176) 


where a; (œ) and b; (œ) are real-valued functions of frequency. The QFT 
technique leads to a design which satisfies these specifications for all per- 
missible plant variations, while approximately minimizing the transmission 
of output sensor noise (if such a design is possible). 

The real performance specification is more likely to consist of bounds 
for closed-loop time responses to particular reference or disturbance signals, 
rather than frequency-domain bounds such as (4.176). Although relatively 
little is known about mapping time-domain bounds into frequency-domain 
bounds, Horowitz and Sidi (1972) have obtained sufficient conditions on 
frequency-domain bounds which imply the satisfaction of time-domain 
bounds; the frequency-domain bounds obtained in this way do not appear to 
be unduly conservative. 

At the heart of the QFT technique is a method for designing SISO 
systems. We shall describe this briefly, before going on to the multivariable 
problem. Figure 4.38 shows a feedback system with two degrees of freedom 
which is to be designed. For the purposes of this description we shall ignore 
disturbances and measurement noise, and assume that we are dealing with a 
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Figure 4.39 Nichols chart with template. 


stable, minimum-phase plant. Figure 4.39 shows a Nichols chart on which 
L(jw) = G(jw)K (jo) (4.177) 
will eventually be plotted. The broken curves are contours of 
{L[1 + L] +] = M (= const.) (4.178) 


namely the familiar M-circles (assuming a SISO system for now). The dashed- 
line rectangle is the template of all possible values of G(jw@,) at some 
frequency w,. Now, suppose that the design specification, at this frequency, is 


a(@,) <| T(ja,)| < b(@,) (4.179) 
where 7(s) is the transfer function 

T(s) = L(s)[1 + L(s)]~*P(s) (4.180) 
If we assume that we can implement the pre-filter P(s) with negligible 
uncertainty, which we generally can, then the variation in |7(ja,)|, as 
G(j@,) ranges over its possible values, will be the same as the variation in 


|L(j@,)[1 + L(j@,)]~*|. We therefore determine all possible positions on 
the Nichols chart to which the uncertainty template of G(jw,) could be 
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Figure 4.40 A set of template boundaries dictated by the specification, together 
with a satisfactory L(j@) locus. 


translated (without any distortion or change of size or orientation), such that 
it would not intersect any pair of M-circles whose values differed by more 
than |b(@,)|/|a(@,)|. The solid-line rectangle shows the template translated 
to one such position. This template is labelled { L(jw,)} since it shows the set 
of all possible values of L(ja@,) which could occur with one particular 
compensator K(s) (assuming that K(s) can be implemented with negligible 
uncertainty). The use of this compensator would reduce the variation in the 
closed-loop response to the required amount. (Note that at this stage we do 
not worry about the actual values attained by |L(1 + L)~*|, since later we 
can design P(s) so as to get the correct range of | T|.) 

We have shown just one possible template, {L(ja,)}, but an infinite 
set of such templates is possible, each of which reduces the variation of 
|L(1 + L)~*| to the required amount. This set has a boundary which corre- 
sponds to the curve labelled B(w) in Figure 4.39. The hatching shows the 
side on which values of L(j@,) may not lie. 

Repeating this construction for a set of frequencies {@,..., Op? 
generates a set of boundaries {B(@,),..., B(@,)}, as shown in Figure 4.40. 
Design now proceeds by finding a loop-gain function L(s) whose frequency 
response satisfies all these bounds. It can be shown that a solution exists (for 
minimum-phase plant) which is optimal in the sense that L(ja) lies on the 
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boundary B(w) for each œ, but this optimal solution may require a compen- 
sator K(s) of great complexity. Design trade-offs are generally made to obtain 
reasonably simple compensators, at the expense of having a larger loop gain 
(at some frequencies) than is required. A typical L(j@) locus is shown by the 
broken line in Figure 4.40. The choice of L(s) is restricted by the requirement 
that the compensator should be realizable, so |L(jq@)| must fall at least as 
quickly as |G(jq@)| at high frequencies, and of course the Nyquist stability 
criterion must be satisfied. The loop-gain function Z(s) must also satisfy 
Bode’s gain—phase relations, so arbitrary loci (on the Nichols chart) cannot 
be attained. 
Once K(s) has been designed, the inequalities 


a(w,)e,(@;) < | L(ja,)[1 + Llo) "| < b(@,)e(@) (4.181) 
will be satisfied for {@,,... , @,} for some c,(@,). In order to meet the design 
specifications 

a,(@;) < | T(ja;)| < b;(a;) (4.182) 


the pre-filter is chosen to have the gain behaviour 
| P(ja,)| = Lelo] (4.183) 


If specifications also exist in the form of bounds on the sensitivity 
function 


S(s) = [1 + L(s)] t (4.184) 


then a similar approach is used to meet these, and a combination of 
specifications on both T(s) and S(s) can be handled. (Note that T is not the 
complementary sensitivity in this subsection, and that T and S are not 
completely determined by each other.) 

This design method is extended to multivariable problems as follows. 
Suppose that the design specifications are of the form given in (4.176), where 
t;;(s) is the (i, j) element of the closed-loop transfer function T(s). From 
Figure 4.38, we have 


(1+ GK)y = GKPr (4.185) 
or, if G is square 
(G+K)y =KPr (4.186) 


where we have used G to denote G~', as before. We shall consider t,,,, namely 
the transfer function from the vth input to the uth output. If r,; =O forj #0, 
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diy (s) 


Figure 4.41 Interpretation of #,, (equation (4.191)) as the output of a feedback 
system. 


then the uth element of the vector KPr is given by 


(KPr), = (KP),,1, (4.187) 
= » kuaiPioto (4.188) 

The uth element of the vector (G + K)y is given by 
([G + K]y), = 3 (Gur + kur)tisto (4.189) 


since y; = typ. 

Now we impose the constraint that k,; = 0 for i # j, namely that the 
compensator K is diagonal. Then, equating (4.188) with (4.189) (because of 
(4.186)), we obtain 


Fy » Gutiy F ri (Guu T kuu)tuv = kuuPuvtv (4.190) 
l#u 


If we now define h;; = 1/4;;, then (4.190) can be rewritten as 


huu KkuuPuv hissy 


= Z 4.191 
ST ha ee 


where 


eee (4.192) 


The expression for t„, has been written in this way because it now shows t,,, as 
the output of the SISO system shown in Figure 4.41, when the reference input 
is an impulse, and the disturbance (at the ‘plant’ input) is — dup- 

The idea now is to design k,,, and p,,, using the technique already 
developed for SISO systems. The term d,,, is not initially known, since it 
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depends on the elements {t,,: | 4 u}, which themselves depend on the details 
of the design. If it is assumed that the specification (4.176) is eventually 
attained, however, then bounds can be obtained on the possible variation of 
d,,, which allows the design of k,, and p,, to proceed. This seems like a 
dangerously circular argument, but it can in fact be shown to be sound. Let us 
write (4.191) as 


luy = Tuy — Zaipdar (4.193) 
and suppose that we can find k,,, and p,,, such that 


|tu (j@)| ar | Tauv(J@)| duve(@) > Q,,(@) > 0 (4.194) 


and 
|Tu(j@)| =F lTauv(J@)|d,,.(@) < byy(@) (4.195) 


where dave is an upper bound on all possible values of |d,,,,| which may occur 
for all possible plants 


pitas ee 


Gow) Fu | Ay (J@)| ia 
Suppose also that (4.194) and (4.195) hold for all values of u and v—that is, 
suppose that m compensator elements k,,,(s) and m? pre-filter elements p,,,(s) 
have been found for which these inequalities hold. Then a result from 
functional analysis (Schauder’s fixed-point theorem) can be used to show that 
(4.176) is also satisfied, and so the design specification will have been met. 
A noteworthy feature of the QFT approach to multivariable design is 
that the compensator K(s) has only m elements -only the diagonal elements 
are designed, and all the off-diagonal elements are fixed at zero. Each 
diagonal element k;(s) appears in the expressions for the m elements t,,(s) of 


the transfer function T(j = 1, ..., m)~that is, it appears in m separate SISO 
design problems. Each of these problems generates a set of boundaries 
{B,(@,),..., B@,)} (j = 1,..., m) which must be achieved by the loop- 


gain function h,;(jw)k,;(j@), and the compensator k,;(s) must be chosen to 
satisfy the most stringent of these. 

This strategy recognizes that the use of cross-couplings inside the 
feedback loop, for the purpose of controlling interaction, is inherently fragile 
if there is significant uncertainty about the plant model. Loop gains are used 
to reduce the amount of uncertainty. Once it has been reduced to an 
acceptable amount, then cross-coupling is used outside the loop, in the pre- 
filter P(s), to obtain the required pattern of interactions. 

As with the sequential loop-closing approach (see Section 4.2), success 


in designing a suitable K(s) may depend on finding the best pairing of plant 
inputs with outputs. 
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We have given a very brief description of the QFT approach to 
feedback design, and have outlined only the simplest version of it. The theory 
has been elaborated to allow the use of off-diagonal elements in K(s), the 
specification of responses to disturbances, trade-off between the bandwidths 
of various loops, and has even been extended to non-linear problems. It is 
rather complex, but much of the complexity can be encapsulated in computer 
software. 


4.10 Control-structure design 


For the design techniques presented in this chapter, it has been assumed 
that the manipulated and controlled variables (plant inputs and outputs) 
have already been chosen. In fact, an important part of multivariable design 
(particularly in the process industries) is the choice of those variables which it 
is most appropriate to control, and the choice of those variables which can 
be manipulated most effectively. Once such a choice has been made, it is 
frequently necessary to choose appropriate pairs of inputs and outputs -— that 
is, to assign each input, or set of inputs, to the control of one particular output 
or set of outputs. This is known as the input/output assignment or pairing 
problem. 

If a fully cross-coupled multivariable controller is to be used, then the 
pairing of inputs and outputs is unimportant since the cross-couplings can 
distribute control actions between inputs and outputs in the most effective 
way. But it is frequently necessary to use decentralized control, in which either 
no cross-couplings are permitted (the compensator transfer function being 
constrained to be diagonal) or the cross-coupling between loops is to be 
minimized (generally resulting in a block-diagonal compensator transfer 
function). In this case finding a suitable pairing is of great importance; if an 
input were to be paired with an output over which it had little or no influence, 
then satisfactory control would clearly be unattainable. Finding a good 
pairing of input and output variables is also important when decentralized 
control is dictated by the design technique, rather than by implementation 
requirements. For example, both the sequential loop-closing approach to 
multivariable design, and the version of the QFT approach presented in the 
previous section, can be made easier by a good initial pairing of inputs with 
outputs. 

Problems such as choosing the input and output variables, and 
deciding how to pair them, are problems of control-structure design. In spite 
of their supreme importance, very little progress has been made towards 
solving such problems systematically. Some promising advances have re- 
cently been made, however, and we shall summarize them in this section. 

An analytical tool for attacking the control-structure problem, which 
has been available for over twenty years, is the relative-gain array, or RGA 
(Bristol, 1966). Suppose that the transfer function from input j to output i of a 
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plant is g;; when all loops are open, and h;; when ail outputs except the ith 
output are tightly controlled. Then the (i, j) element of the RGA T is 


Yy = - (4.197) 


This element can be evaluated from the open-loop transfer function of the 
plant, as follows. Suppose that a change Au; in the (transform of the) jth input 
causes a change Ay, in the kth output. Then 


Au; = 3 Gx AVe (4.198) 


where G = G71, as before. (We assume that the plant is square.) But if all the 
loops except the ith are tightly controlled, then Ay, = 0 for all k # i. Thus 
(4.198) simplifies to 


Au; = gi Ay; (4.199) 
from which we obtain 


hj = £ (4.200) 
Iji 


and (4.197) becomes 
Yy = JiyÂj (4.201) 
In terms of matrix operations, the relative gain array is defined by 


T (s) = G(s). * G"(s) (4.202) 
where ‘*.*” denotes element-by-element multiplication (the Schur or 
Hadamard product). 

The assumption that all the other loops are perfectly controlled is of 
course highly artificial unless it is restricted to a specified range of frequencies. 
In fact the RGA is almost invariably defined in terms of steady-state gain 
information only—namely, for s = 0. From now on we shall use the term 
‘relative gain array’ (or RGA) to mean T (0) only, except where specified 
otherwise. There are two reasons for this restriction. First, if integral control 
is used then the assumption of perfect control at zero frequency is entirely 
realistic, and integral control is used very commonly. Secondly, the RGA 
has been used most widely in the process industries; in these industries it is 
often difficult to obtain reliable dynamic models, whereas steady-state gain 
information is readily available. 
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The following properties of the RGA can be derived easily: 


Theorem 4.5: The mxm RGA I(s) = G(s). *G"(s), with elements 
{yy} has the following properties: 


(1) y= ÈZ vy=1 
j=l i=1 


(2) Permutations of the rows or columns of G(s) result in the same 
permutations of I (s). 


(3) Input or output scaling leaves I (s) unchanged. That is, if S 
and T are diagonal matrices, G,(s)=SG(s)T, and T3(s)= 
G,(s).* G3(s), then ',(s) =T(s). 

(4) If G(s) is triangular (and hence also if it is diagonal), then 
T(s)=]1. 


Proof: The proof is left as an exercise for the reader. 


The RGA was introduced as a measure of interaction which could be 
used both to select those input and output variables to be used for control, 
and tọ decide on how to pair them, if a decentralized control structure were 
required. It has been used widely and successfully in the process industries 
(Shinskey, 1979, 1981; McAvoy, 1983), but until recently virtually no theoreti- 
cal explanation of its utility has been available. It has been known, for 
example, that choosing variables which result in large or negative elements in 
the RGA leads to difficulties in controlling the plant, and that input and 
output variables should be paired so that the diagonal elements of the RGA 
are as close as possible to unity. 

Significant progress towards providing such an explanation has been 
made by Grosdidier et al. (1985), Nett and Manousiouthakis (1987), and 
Skogestad and Morari (1987). Of their results, we shall summarize those most 
relevant to the control-structure design problem, but we shall omit proofs 
and derivations. Not all of these results involve the RGA; some are formu- 
lated in terms of the steady-state gain matrix G(0). Most of the results relate 
to the feedback loop shown in Figure 4.42, in which the compensator has the 
structure 


K(s)= Eels) (4.203) 


so that integral action in each loop is assumed, and G(s)C(s) is proper (that 
is, || G(co)C(co)|| <). We shall assume that G(s) has m inputs and m 
outputs, and that T (0) denotes its RGA. 

The first result is the following. Suppose that C(s) is diagonal (de- 
centralized control), G(s) is stable, and the closed-loop system remains stable 
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R 
Figure 4.42 Feedback loop with compensator of structure K(s) =—C(s). 
S 


if any m— 1 feedback loops are opened. Then the system is unstable with all 
loops closed, for any k > 0, if (and only if when m < 2) 


det [G(0)] | Tl g;(0) < 0 (4.204) 


This condition may be met by some input/output pairings but violated by 
others, since interchanging any pair of rows or columns in G(s) changes the 
sign of det G(0)] and puts two new elements onto the principal diagonal. 

The next result again assumes C(s) to be diagonal: if y,,(0) < 0 for any 
i, then for any k>0 the closed loop has at least one of the following 
properties: 


è itis unstable 
e loop iis unstable when all the other loops are opened 


e itis unstable when loop i is opened 


So if the RGA has any negative entries on its principal diagonal, and integral 
action is employed, then it is impossible to obtain complete integrity (stability 
in the presence of loop failures) as long as a completely decentralized control 
structure is used. It is also impossible to design each loop independently in 
this case. 

If m = 2 then it is always possible to obtain y;(0) > 0 by a suitable 
pairing of inputs and outputs, but if m > 3 this may not be so. In such a case 
the use of a cross-coupled controller is strongly indicated. 


EXAMPLE 4.6 


If 
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then 
— 1.89 3.59 —0.7 


T(O)=| —0.13 3.02 —1.89 
3.02 —5.61 3.59 


and it is impossible to make y,,(0) > 0 for i = 1, 2, 3 by any permutation of 
rows or columns. 


We shall say that the system Q(s) = G(s)C(s) is integral-controllable° 
if the system shown in Figure 4.42 is stable for any k such that 0 < k < k* (for 
some k* > 0), and has zero steady-state error (the latter condition rules out 
cancellation of the integrators by zeros). Systems which are integral- 
controllable can be tuned on-line, turning the gain up gradually from zero. 

Two more results are known on integrity in the face of loop failures. 
If m = 2 and y;(0) > 0 (i= 1, 2), then there exists a diagonal C(s) such that 
Q(s) = G(s)C(s) is integral-controllable, and each loop remains integral- 
controllable if the other loop fails. If m= 3, y,(0)>0 (i=1, 2, 3) and a 
diagonal C(s) is such that Q(s) is integral-controllable and q,;(0) > 0, then the 
failure of any one loop leaves the other loops still integral-controllable. 

The next result shifts the emphasis to attainable performance rather 
than stability. If g(s) has no zeros in the right half-plane, Q(s) is strictly 
proper (|q,;(00)| =), |gx:(s)| approaches zero at least as fast as |g;(s)| for 
large |s| and all k, J, then h,,(s) has either right half-plane zeros or poles if 
y,;(0) < 0, where h,,(s) is the transfer function from u; to y; when all the other 
loops are closed with integral control. This result does not give the location of 
any right half-plane zeros; in particular cases these may turn out to be quite 
harmless for the design of the ith loop, but it certainly reinforces the belief 
that negative diagonal elements of the RGA are to be avoided if possible. 

The last batch of results about the RGA concerns the robustness of a 
design to modelling errors or changes in the plant’s behaviour. The idea is 
that large entries in the RGA indicate a plant which is poorly conditioned 
with respect to inversion. If the controller for such a plant is intended to 
invert the plant’s transfer function at some frequencies, in order to reduce 
interactions, then its performance (and even its stability) is likely to be very 
sensitive to changes in the behaviour of the plant. Examples of such con- 
trollers are those designed by the ALIGN algorithm or by pseudo- 
diagonalization. 

The basic result is: 


Theorem 4.6: 


OO eg OO (4.205) 


A ij 
Iji Jij 
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where G* is the cofactor of g;;. Hence 


Inĝ; = InG} — Indet(G) 


= In Gi ms inf aor 
k 


Differentiating, 
1 0G ji = Gi 
G5 OG; det (G) 
Te Ôj 
and therefore 
6G ji > 9; 
= ijji 2 
ji ij 
093; 
Tom Vay 7 a 
Jij 


This theorem shows that a change in g,, results in a much larger relative 
change in ĝ;, if |y; | > 1. It is not surprising, then, that large elements in the 
RGA can be related to large values of the condition number 


a(G) 


cond(G) = Z(G) 


(4.206) 


(which is discussed in some detail in Chapter 8). The condition number 
depends on the scaling employed, whereas the RGA does not, so we define the 
optimal condition number cond*(G) to be 


cond*(G) = min cond (SGT) (4.207) 
8,T 
where S and T are diagonal matrices. If we also define the 1-norm 


ITI, = max ¥ I (4.208) 
j i= 
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then the following result is available. If m = 2, then 
cond*(G) = |T + /(IP I ~1) (4.209) 


It is conjectured that, for any m, 


1 
i a F < : 
>. lul sond®(G) < cond*() a (4.210) 


This result is known to hold for m = 2. 

These and similar results can be used to show that, if the RGAs of 
both the plant and the controller have large elements, then the closed-loop 
system has little stability robustness in the face of uncorrelated perturbations 
at the plant input (that is, if the plant transfer function is G = G(T + A), 
where Gy is the nominal model, and A is diagonal). Such perturbations are 
almost always present, because of uncertainty about the exact behaviour of 
control actuators. If K(s) approximately inverts G(s), then the RGA of K(s) 
will be approximately the same as that of G(s). So, if the plant’s RGA 
contains large elements, an inverting (decoupling) compensator should not be 
used. A diagonal compensator has an RGA with no element larger than 1, 
and may therefore be able to provide reasonable stability robustness even if 
the plant’s RGA has large elements. It is unlikely to produce good per- 
formance in the face of disturbances, however, since it cannot remove 
interaction. (Good performance in response to reference or set-point changes 
may still be obtained if a control structure with two degrees of freedom is 
used, as in the QFT design method.) One can conclude, then, that large 
elements in the plant’s RGA certainly indicate a plant which is difficult to 
control. 

It should be noted that all the results presented from Theorem 4.6 
onwards hold at any frequency, and there is no need to restrict them to s = 0. 
In fact, the RGA of the plant should be inspected at frequencies up to the 
largest 0dB cross-over frequency required, and a little higher, to check for 
inherent sensitivity problems. 

The RGA is the major analytical tool developed specifically for the 
control-structure design problem, but other information is available which 
can be used to make decisions about the structure.® 

A Nyquist array, for example, may sometimes provide a quick visual 
indication of how a plant may be subdivided into several, almost independent 
sub-processes. Principal-gain plots for various blocks of the plant’s transfer- 
function matrix may indicate which inputs are the most effective for con- 
trolling particular outputs, and may also indicate whether there is an obvious 
decomposition of the plant into ‘fast’ and ‘slow’ sub-plants. 

If m> 3, then the number of possible control structures becomes 
extremely large if one considers all possible decentralized schemes — that is, all 
possible ways of dividing an m-input, m-output plant into p subsystems, each 
of which has m; inputs and outputs, with )\?_ , m; = m. Even if one takes each 
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m; = 1 (i.e. completely decentralized) there are still m! possible input/output 
pairings to try, and the number of all possible schemes is much larger than 
this. For example, with m = 4 there are 131 possibilities, and with m = 5 there 
are 1496. In practice it is impossible to try more than a small fraction of all the 
possible combinations ‘at random’, and there is as yet no systematic way of 
searching through all the possibilities. The RGA helps a little in this respect, 
but the main guidance towards choosing an appropriate structure must come 
from a physical understanding of the plant. 


SUMMARY 


In this chapter we have presented several approaches to multivariable 
design, all of which generalize, in one way or another, the classical tech- 
niques of feedback design associated with the names of Nyquist and Bode. 
Each approach ultimately replaces a multivariable problem by a set of SISO 
design problems. In the sequential loop-closing approach, the Nyquist-array 
approach and the characteristic-locus approach there is a preliminary stage 
in which the aim is to reduce high-frequency interaction and make sub- 
sequent design steps easier. 

In each of these methods, this preliminary decoupling stage can be 
aided by use of the ALIGN algorithm, or one of its generalizations, which 
allows the system to be decoupled across a range of frequencies by either a 
constant or a dynamic compensator. If a dynamic compensator is obtained 
automatically in this way, however, there is a danger that it may have zeros 
which cancel unstable (or very lightly damped) poles of the plant. 

If a precise specification of both the model uncertainty and per- 
missible closed-loop behaviour is available, then the use of the quantitative 
feedback theory should be considered. This is more complicated than the 
other approaches, but it is in many ways more systematic and more 
powerful. 

Before embarking on a multivariable feedback design, one should 
always consider carefully the design of the controller structure. Is it 
possible to divide the problem into smaller multivariable problems, or into a 
mixture of SISO and multivariable problems, or, best of all, into a set of SISO 
problems only? The answer depends not only on the plant, of course, but 
also on the design specifications. Is it possible to choose a better set of input 
and output variables, or a better pairing of the existing set? The relative-gain 
array gives some help in answering such questions, but this important 
problem area of multivariable design remains largely unsolved. 

Almost all of the material in this chapter can be applied to plant 
described by irrational transfer functions —in particular, to plant containing 
time delays. The ‘input’ to most of the techniques described is the fre- 


quency response of the plant, together with knowledge of any unstable 
poles. 
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Notes 


1. Much of the research literature contains similar suggestions, based on system 
characteristics at arbitrarily high frequencies. Such suggestions ignore the 
facts that real control systems are subject to power limitations, which manifest 
themselves as bandwidth limitations, and that models are valid over limited 
frequency ranges only. 


Ford (1985) has given some bounds for the extent of interaction, in terms of 
direct Nyquist arrays; these do not help to predict the diagonal elements of the 
closed-loop response, however. 


Ford and Daly (1979) also include negative powers of s in k,(s). But since each 
column can be scaled by an arbitrary scalar function, equation (4.159) is 
sufficiently general. 

This example illustrates nicely the point of Definition 2.7: that stability 
of a feedback system requires each of the transfer functions GK(I + GK)“, 
KG(I + KGY t, G(I + KG)! and K(I + GK) ' to be stable. In this case 
they are all stable; however, if we defined a generalized notion of stability (see 
Section 2.11) to mean that ‘all poles must have a damping factor of at least 
0.2’, then our design would be unstable in this sense, but only one of the four 
transfer functions would exhibit the instability. 

The concept of integral controllability was introduced by Grosdidier et al. 
(1985). 

Other tools introduced recently to help attack the control-structure design 
problem are the block relative-gain array (Manousiouthakis et al., 1986) and 
the structured singular-value interaction measure (Grosdidier and Morari, 
1986). 
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EXERCISES 


4.1 Repeat the characteristic locus design exercise performed on the AIRC 
model (see the Appendix) in Section 4.4, but aim for a closed-loop 
bandwidth of about 3 rads~?. 


4.2 Design a compensator for the TGEN model (see the Appendix), 
assuming a control structure with one degree of freedom, and aiming 
for a closed-loop bandwidth of about 0.5 rads” +, using one or more of 
the following methods: 


(a) sequential loop closing 
(b) characteristic locus 
(c) direct Nyquist array 


The sensitivity should not exceed 0.1 for œ < 0.05 (for any signal 
direction), and the output-sensor noise amplification should not 
exceed 0.002 for œ > 50 (for any signal direction). 
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4.6 


4.7 
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(a) A SISO negative-feedback system has a forward-path transfer 
function I(s) and a feedback transfer function f(s). Its closed- 
loop transfer function is h(s). Show that 


1 1 
ns) Hs) 7 
(b) Show that the relationships indicated in Figure 4.17 hold for 
the (SISO) feedback loop shown in Figure 4.18. 


Prove Theorem 2.12, and prove also that the degrees of row dominance 
and column dominance of XGX~', when X is chosen optimally, 
are the same. 


Refer to Example 4.5, with the system defined by equation (4.150). 


(a) Using suitable software, display a Nyquist array of G(ja), and 
check that it is not column dominant. Compute the appro- 
priate Perron—Frobenius eigenvalues and eigenvectors and 
check the correctness of Figures 4.20, 4.21 and 4.22. 


(b) Using the INA technique, design further compensation for 
each loop which will give a closed-loop bandwidth of about 
10rads~* and well-damped responses. Include integral action 
in your compensator. 


(c) Investigate whether column dominance of KG(jw) can be 
obtained using a constant, diagonal K. 


(d) Investigate whether (a) and (b) can be repeated using the DNA 
method instead of the INA method. 


eel 


find a diagonal matrix K, with positive elements, such that GK is row- 
dominant. 


Given 


Prove the properties of the RGA listed in Theorem 4.5. 


Show that the plant 


G(s) = 
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cannot be stabilized by a fully decentralized compensator of the form 


= k,(s) 0 
K)=| 0 Ba 


Does the relative gain array indicate the presence of this difficulty? 
(The pole at + 1 is known as a ‘fixed mode’ in the literature on 
decentralized control.) 


4.9 Examine the RGA for the TGEN model (see the Appendix). Comment 
on its implications, if a compensator is to be designed to give a closed- 
loop bandwidth of about 0.5 rad s™+. 


4.10 Examine the possibility of using decentralized control for the AIRC 
model (see the Appendix). Do not assume that the same speed of 
response is required for each controlled output. 
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CHAPTER 5 
Multivariable Design: 
LQG Methods 


5.1 Introduction 5.6 Shaping the principal gains 

5.2 The solution of the LQG 5.7 Some practical considerations 
problem 5.8 Design example 

5.3 Performance and robustness of 5.9 Non-minimum-phase plant 
optimal state feedback Summary 

5.4 Loop transfer recovery (LTR) Exercises 

5.5 Design procedure for square References 
plant 


5.1 Introduction 


In this chapter we shall begin examining design methods based on optimal 
control theory. The particular theory we shall be concerned with here is that 
of the so-called linear quadratic Gaussian or LQG problem. We shall state the 
main results of this theory, but for fuller details we refer the reader to 
Kwakernaak and Sivan (1972), Anderson and Moore (1972), Davis and 
Vinter (1985), IEEE (1971), Åström and Wittenmark (1984), Franklin and 
Powell (1980), Banks (1986); many others could have been mentioned. These 
sources establish the underlying theory, whereas here we shall be concerned 
with developing a design method based on the theory. This method is one 
which allows the designer to shape the principal gains of the return ratio, at 
either the input or the output of the plant, to achieve required performance or 
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robustness specifications. Stability is obtained automatically, so it is not 
necessary to worry about the phase characteristics—as we shall see, the 
characteristic loci do not need to be examined. 

The problem addressed by the theory is the following. Suppose that 
we have a plant model in state-space form, 


x= Ax + Bu+Tw (5.1) 

y=Cx+o (5.2) 
where w and v are ‘white noises’, namely zero-mean Gaussian stochastic 
processes which are uncorrelated in time (but may be correlated with each 
other), having covariances 


E{ww'}=W20,  Efwl™=V>0 (5.3) 


We shall assume in this chapter that w and v are in fact uncorrelated with 
each other, namely that 


E{wo™} =0 (5.4) 
In (5.1) u represents the vector of control signals; in (5.2) y is the vector of 


measured (but corrupted by v) outputs. The problem is then to devise a 
feedback-control law which minimizes the ‘cost’ 


T 
J = lim ey | (z'Qz + uRupat (5.5) 
T> © 0 : 
where 
z= Mx (5.6) 


is some linear combination of the states, and 
Q=Q™>0, R=R™>0 6.7) 


are weighting matrices. 

The solution to the LQG problem is prescribed by the separation 
principle (or ‘certainty equivalence’, in the literature on econometrics), which 
states that the optimal result is achieved by adopting the following procedure. 
First, obtain an optimal estimate % of the state x (optimal in the sense that 
E{(x — $)" (x — %)} is minimized), and then use this estimate as if it were an 
exact measurement of the state to solve the deterministic linear quadratic 
control problem. The point of this procedure is that it reduces the problem to 
two sub-problems, the solutions to which are known.! 
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Figure 5.1 The Kalman filter. 


The solution to the first sub-problem, that of estimating the state, is 
given by Kalman-filter theory. Figure 5.1 shows the block diagram of a 
Kalman filter, which is seen to have the structure of a state observer; it is 
distinguished from other observers by the choice of the gain matrix Kç, a 
formula for which will be given in the next section. Note that the inputs to the 
Kalman filter are the plant input and output vectors, u and y, and that its 
output is the state estimate vector Å. 

The second sub-problem is to find the control signal which will 
minimize the (deterministic) cost 


i (z'Qz + u" Ru)dt (5.8) 


on the assumption that 
ž = Ax + Bu (5.9) 


The solution to this is to let the control signal u be a linear function of the 
state: 


u= —K,x (5.10) 


where the state-feedback matrix K, is given according to the formula which 
appears in the next section. 

Now, if the control problem to be solved can really be represented by 
equations (5.1)-(5.7)—that is, if only well-defined stochastic processes are 
acting on the plant and measurements, the concern is to minimize the cost 
function (5.5), and the available linear model is reliable, then there is little 
more to be said. Usually, though, there are the familiar aspects of the control 
problem that are not captured by this mathematical formulation. There 
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Figure 5.2 The LQG compensator structure. 


are model uncertainties, non-linearities, various kinds of disturbance and 
possibly many constraints on realistic solutions, none of which can easily be 
given a mathematical representation. In this case one may still be interested 
in using the LQG theory as a method for synthesizing controllers, but with 
the matrices W, V, Q and R which appear in the problem formulation 
considered as ‘tuning parameters’ which are to be adjusted until a satisfactory 
design is obtained, rather than as representations of aspects of the real 
problem. 

This is the view we shall take in this chapter. From the perspective of 
the design problem as we have formulated it in earlier chapters, we are simply 
designing a compensator with a particular internal structure. This structure is 
the series connection of a Kalman filter with a state-feedback matrix; a block 
diagram of it is shown in Figure 5.2. 

To judge the suitability of a candidate design with this structure, we 
shall use methods introduced in earlier chapters. In particular, we shall make 
much use of principal gains. 


5.2 The solution of the LQG problem 
The optimal state-feedback matrix K, is given by 


K.= RB" P. (5.11) 
where P, satisfies the algebraic Riccati equation 


ATP, + P,A — P,BR7'B™P, + M™QM =0 (5.12) 
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and P, = PT > 0. (In general there are many solutions to (5.12), but only one 
of them is positive-semidefinite.) 
The Kalman-filter gain matrix K; is given by 


K; = PCV! (5.13) 
where P, satisfies another algebraic Riccati equation (dual to (5.12)): 
P,A’ + AP; — PCTV ICP +T WI? =0 (5.14) 


and 


The matrices K, and K; exist, and the closed-loop system is internally 
stable, provided the systems with state-space realizations (A, B, Q"? M) and 
(A, T W+/?, C) are stabilizable and detectable — namely, provided any un- 
controllable or unobservable modes are asymptotically stable. 

Several algorithms are available for solving (5.12) and (5.14). Here we 
give one put forward by MacFarlane (1963) and Potter (1966), which has the 
advantage of requiring only linear algebra. More sophisticated versions, with 
improved numerical properties, are described in Section 8.3.10. 

Form the Hamiltonian matrix (of dimensions 2n x 2n, if n is the 
number of states) 


A — BR~1B" 
Bel aou “at | (5.15) 


which has the property that if 4 is an eigenvalue, then so is — å (with the same 
multiplicity). Let U be the matrix of eigenvectors of H, ordered so that the 
left-most n columns correspond to eigenvalues with negative real parts, and 
the right-most n columns therefore correspond to eigenvalues with positive 
real parts. (If (A, B, Q'/?M) is minimal, H has no eigenvalues on the 
imaginary axis.) Now partition U into n x n blocks: 


U= Pa a (5.16) 
Us, Uz2 


The solution to (5.12) is then given by 
P, = U2, Uy} (5.17) 
To solve the dual equation (5.14), replace A by AT, B by C7, R by V 


and M'QM by TWT", and apply the same algorithm. There will be no 
eigenvalues on the imaginary axis if (A, W™?, C) is minimal. 
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If we substitute (5.10) into (5.9) we obtain 
x =(A—BK,)x (5.18) 


as the equation of the closed-loop system which would result if full-state 
optimal feedback were implemented. The eigenvalues of A — BK, can be 
shown to be the (open) left half-plane eigenvalues of H, which shows the 
hypothetical state-feedback scheme to be asymptotically stable. 

From Figure 5.1, the state equation of the Kalman filter is seen to be 


$ =(A—K,;C) + But Key (5.19) 


=| & 


The eigenvalues of A — K;C can be shown to be the left half-plane eigen- 
values of the Hamiltonian matrix for the filter problem, which shows the 
Kalman filter to be an asymptotically stable observer. 

Now, the equation of the combined Kalman-filter/optimal state- 
feedback scheme is 


d|x A —BK x Tw 
— = : 5.20 
ol | Ee pee a 629) 


If we define ¢ as the state estimation error, 


A 


g=x-xX (5.21) 


then (5.20) can be rewritten as 


d| x A—BK, BK, x Tw 
skl- 0 je a eee nee) 


which shows that the closed-loop eigenvalues of the LQG-compensated plant 
are just the union of the eigenvalues of the optimal state-feedback scheme 
with those of the Kalman filter. The overall scheme is therefore internally 
stable (under the stated assumptions). 


5.3 Performance and robustness of optimal 
state feedback 


Suppose that all the states of the plant are available for feeding back through 
K., so that we can dispense with the Kalman filter, as shown in Figure 5.3. 
The return ratio at the input to the plant is then 


—H,(s) = —K,(sI— A)’ B (5.23) 
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Figure 5.3 Hypothetical optimal full state-feedback controller. 


so that the return difference at the input is 

F.(s)= I+ K.(sI— Ay 'B (5.24) 
By adding and subtracting sP, to (5.12), we obtain 

—(—sI — A")P,— P.(sI — A)— P,BR~'B'P,+ MTOM =0 (5.25) 


Now, multiplying on the left by B'(—sI—A™)~' and on the right by 
(sI — A)"'B gives 


— BTP (sI — A) +B — B™(—sI — A")~1 P,B — B™(—sI — AT)“ 
x P,BR~! BTP (sI — A)"1B + B1(—sI — A™)~! 


x M™QM(sI — A)"'B =0 (5.26) 
or 
— RH,(s)— H}(—s)R— Hi (—s)RH.(s) + Ga (—s)QG,(s) = 0 
(5.27) 
where 
G,(s) = M(sI — A)7*B (5.28) 


The first three terms in (5.27) can be rewritten as R — [I + H12(—s)]R[I 
+ H,(s)], so, remembering that F,(s) = I + H,(s), we can rewrite (5.27) as 


F¢(—s)RF,(s) = R + Gi(—3)QG,(s) (5.29) 
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(Note that G,{s) is not the plant’s transfer function unless M = C. Equa- 
tion (5.29) was first derived by MacFarlane (1970).) 
From (5.29) it is clear that 


F! (jo)RF.(j@) >R (5.30) 
Safonov and Athans (1977) have shown that, if 

R=pl (5.31) 
so that 

FE (j@)F.(j@) >I (5.32) 
then there is at least 60° of phase margin in each input channel (that is, pure 
phase changes of 60° can be tolerated in each input channel simultaneously 
without losing stability), and infinite gain margin (in the classical sense: the 
gain in each channel can be increased indefinitely without losing stability). 
The loop may be conditionally stable, but it has a margin of at least 6dB 
against gain reductions. Furthermore, it follows immediately from (5.32) that 

o(F .(ja)) > 1 (5.33) 
which in a SISO system would imply that the sensitivity never exceeds 1 


(since F,(s) is also the return difference at the plant output in a SISO system). 
We also have that 


o{H,(jo)[I +H. (jo) +} = {1 — F7 (jo) (5.34) 
<1+6{F7t(jo)} (5.35) 
<2 (5.36) 


since (5.33) implies that {F7 '(j@)} <1. Applying the small gain theorem 
(Chapter 3), we see from (5.36) that the full-state optimal feedback system is 
robust against any unstructured multiplicative perturbation A, at the input of 
the plant, for which 


G(A(jo)) <3 (5.37) 


For single-input plants these results can be given striking graphical 
interpretations in terms of the Nyquist locus, as first shown by Kalman 
(1964). The inequality (5.33) becomes |1 + H,(ja)| > 1, which means that the 
Nyquist locus stays outside the circle with centre — 1, radius 1, as shown in 
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Figure 5.4 Nyquist locus at input of optimally regulated single-input plant (with 
full state feedback). 


Figure 5.4. The inequality (5.36) becomes 


H,(jo) 
ees ca) (5.38) 


which states that the Nyquist locus remains outside the M = 2 circle, namely 
the circle with centre — 4/3 and radius 2/3. It is seen from Figure 5.4 that, for 
a single-input plant, the results obtained by Safonov and Athans (1977) are a 
simple consequence of (5.33). 

Figure 5.4 also shows that the full state-feedback controller can 
achieve its impressive properties only because it behaves like a first-order 
system at high frequencies. Above the 0 dB cross-over frequency the phase is 
close to —90°, and hence the gain ‘roll-off’ rate is at most —20dB/decade. 
The same is true when there are several inputs: for some wg and e, 


a(H.(jo)) > —, W> Wo (5.39) 


E 
w 

A remark is in order about the assumption (5.31). This is not as 
restrictive as it appears, because we can apply (possibly non-diagonal) scaling 
to our plant, replacing B by BR‘, and equivalently replacing R (in the cost 
function (5.5)) by I. This procedure is particularly appropriate if it is known 
that the model uncertainties in some input directions are significantly differ- 
ent from those in other directions. 

Since equations (5.13) and (5.14) are dual to (5.11) and (5.12), respect- 
ively, it is apparent that we can obtain a similar result to (5.29) for the return 
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difference of the Kalman filter. The return ratio of the filter, evaluated at the 
input to the gain matrix K; (see Figure 5.1), is 


— H,(s) = —C(sI — A) t Ke (5.40) 
and the corresponding return difference is 

F,(s) =1+C(sI — A) 'K; (5.41) 
Proceeding as before, we obtain 

F(s) VF}? (—s) = V+ G,(s) WG? (—s) (5.42) 


where G,(s) = C(sI — A)~1T, and we conclude that the Kalman filter, viewed 
as a feedback system, has good robustness and performance properties. 


5.4 Loop transfer recovery (LTR) 


Since both the optimal state-feedback regulator and the Kalman filter have 
such good properties, it might be expected that LQG compensators would 
generally yield good robustness and performance. Unfortunately this is not 
the case: Doyle (1978) has shown that LQG designs can exhibit arbitrarily 
poor stability margins. The usual advice given on state-feedback/observer 
design is to make the observer dynamics much faster than the desired state- 
feedback dynamics. One might therefore expect that, if the eigenvalues of 
A — K,C were pushed sufficiently far into the left half-plane, then one would 
obtain nice state-feedback properties at the plant input. This also turns out to 
be false, as shown by Doyle and Stein (1979). They demonstrate that in some 
cases stability margins are actually reduced as the observer (filter) dynamics 
are made faster. 

Fortunately there is a way of designing the Kalman filter so that the 
full state-feedback properties are ‘recovered’ at the input of the plant. What is 
needed is for some of the filter’s eigenvalues to be placed at the zeros of the 
plant, the remainder being allowed to become arbitrarily fast; we shall outline 
a procedure for achieving this, taken from K wakernaak (1969) and Doyle and 
Stein (1979, 1981). Since the procedure relies on the ‘cancellation’ of some of 
the plant dynamics (in particular the zeros) by the filter dynamics, it is 
guaranteed to work only with minimum-phase plants. If right half-plane 
zeros exist in the plant the procedure may still work, particularly if these 
zeros lie beyond the operating bandwidth of the system as finally designed. 

The robustness and performance properties at the input of the plant 
are determined by the return ratio at the point marked 1 in Figure 5.2, 
whereas the return ratio (5.23), which is the one we would like to have, is the 
return ratio at the point marked 2. (It may seem surprising that the return 


232 MULTIVARIABLE DESIGN: LQG METHODS 


ratio at point 2 is as simple as — K,(s! — A)” t B. The reason is that the error 
dynamics of the filter are not excited by the control input u — in fact they are 
uncontrollable from u. If the only signal acting on the system is u — the loop 
being broken at point 1 — and if the matrices A, B and C in the filter match 
those of the plant exactly, then the signal at the input to the gain matrix Kç is 
zero.) In general, the return ratios at points 1 and 2 are quite different. The 
loop transfer recovery or LTR procedure which we shall now describe forces 
the return ratio at point 1 to approach that at point 2. 

Let the transfer function of the LQG compensator shown in 
Figure 5.2 be K(s). The return ratio at point 1 is then 


K(s)G(s) = —K,(sI- A +BK,+K,C)"'K,C(sI—A)"'B (5.43) 
If we let 

(s) = (sf — A) ' (5.44) 
and 

W(s) = (sI — A+ BK,)7! 


then (5.43) becomes (writing ¢ instead of ¢(s), etc.) 


KG = —K,[¥-'+K,C]"1K,;CoB (5.45) 
= —K,[¥—WK,(C¥K,+1)1C¥]K,CoB (5.46) 
= —K,¥K,[I—(C¥K,+ 1) 1C¥K,]CoB (5.47) 
= ~K, ,¥ K;[I + C#K;] CoB (5.48) 


In (5.46) and (5.48) we have used the matrix-inversion lemma (see Chapter 3, 
Note 3): 


(A+ BCD)! = A`! — AT! B(DA~!B+ C7!)"!DA™! 


Now suppose that we obtain K; by choosing the covariance matrix W 
(in equations (5.3)) as 


W = Wo + qÈ} (5.49) 
where È = È" > 0, and q is a real, positive parameter. Here Wo could be an 


estimate of the true process-noise (w) covariance, for example; in order to 
obtain LTR we shall need to increase q, in theory to arbitrarily large values. 
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Substituting for W in (5.14), we obtain 


P,Av AP, PCTV CP, TWoIt 
pet + 
q q q 


+TEIT=0 (5.50) 


Now, it can be shown (Kwakernaak and Sivan, 1972, p. 307) that, if 
C(sI — A) +T W+’? has no transmission zeros in the right half-plane, and if it 
has at least as many outputs as rank (Z), then 


P, 
lim == 0 (5.51) 


4>% 4 


This result may be thought of as meaning that, as q is increased, so the 
Kalman filter is being ‘told’ that an increasing proportion of the variance in 
the plant output is due to state variations, and a decreasing proportion to 
measurement errors. In these circumstances the filter is able to track the state 
trajectory relatively well: P, is (the filter’s estimate of) the covariance of the 
state estimation error, and (5.51) states that this is increasing more slowly 
than the process-noise covariance (if the stated assumptions hold). 
From (5.50) and (5.51) we have 


lim (qT EIT} = PCT y"? (5.52) 
qr œ 
But 
K; = Pc Vv (5.53) 
so 
K; >q PTE"? V" as qo (5.54) 


In particular, if we choose 

T = B, a | (5.55) 
and provided C(sI — A)~'B has no zeros in the right half-plane, then 

K; >q? BVT! asq>ou (5.56) 
Substituting this back in (5.48) gives | 


KG > ~q"? K, PBV P[I +q" C¥BV-1?]-!ChB asq>o 
(5.57) 


—> —K,YBV "2 [C¥#BV- 2] CoB (5.58) 


4>% 
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provided C'P B is square — that is, if the plant has as many inputs as outputs. 
Maintaining this assumption, and noting that 


Y= ¢o[1+ BK,ġ] t (5.59) 
we obtain 

lim KG = —K,¥B(CYB)"'CdB (5.60) 

q> © 


Il 


—K,¢[1+BK,¢] | B(C¢[I + BK,¢] +B) +CoB 


(5.61) 
= —K,oB[I+K,¢B] ‘(CéB[1+K,¢B]"')'CoB 

(5.62) 
= —K.oB (5.63) 


which is the return ratio at the point 2 in Figure 5.2, as required. 
The eigenvalues of the resulting Kalman filter are the zeros of 


det (sI — A+ K,C) = det(sl-—A+q'?BV'?C) (5.64) 


= det [(sI — A)(I, + q1? (sI — A)" t BV +? C)] 
(5.65) 


= det (sI — A) det (Ip + q! C(sI — A)" + BV +°) 
(5.66) 


where n is the state dimension of the plant, and m is the number of inputs or 


outputs. 
If |s| remains bounded, then 


det (sI — A + K,C) — q"? det (sI — A) det [G(s) V7] (5.67) 


q> œ% 


= q™!? p(s) det [G(s)] det (V712) (5.68) 


where p(s) is the pole polynomial of the plant. But p(s)det [G(s)] is just the 
zero polynomial of the plant, so it is seen that some of the filter’s eigenvalues 
remain bounded, and are in fact ‘attracted’ to the plant’s zeros as q is 
increased. The remaining eigenvalues become unbounded as q increases 
indefinitely. 
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5.5 Design procedure for square plant 


The result derived above leads to the following two-step approach to design. 


Step 1. Do the design on the deterministic Linear Quadratic prob- 
lem alone, manipulating the weighting matrices Q and R until a return 
ratio — K,(sI — A)~' B is obtained which would be satisfactory at the 
plant input. The emphasis of the design here is on aspects such as 
obtaining appropriate 0dB cross-over frequencies for the principal 
gains, possibly ‘balancing’ the principal gains, and adjusting the low- 
frequency behaviour. Robustness properties are obtained auto- 
matically. Of course, when analysing a trial design one may prefer to 
examine the principal gains of closed-loop quantities such as the 
return difference I + K,(sI — A)~!B, or its inverse. 


Step 2. Synthesize (‘design’ seems inappropriate here, since the pro- 
cedure is virtually automatic) a Kalman filter by setting T = B, 
W = W, +qI and V= I; alternatively, choose W = I and V = pl. 
Increase q (or reduce p) until the return ratio at the input of 
the compensated plant has converged sufficiently closely to 
—K,(j@I—A)~*B over a sufficiently large range of frequencies. 
Do not make q larger than is necessary, since this would force the gain 
‘roll-off’ rate to be only 20dB/decade at unnecessarily high fre- 
quencies, which would cause unnecessary noise amplification at these’ 
frequencies, and actually reduce the robustness of the design in the 
face of high-frequency unstructured perturbations. 


It is often desired to design the return ratio at the output of the plant 
rather than the input; in fact for design of the sensitivity function and of 
performance this is more appropriate. In this case one can follow a procedure 
which is dual to that described above: 


Step 1. Design a Kalman filter by manipulating the covariance 
matrices W and V until a return ratio — C(sI — A)~' Kẹ is obtained 
which would be satisfactory at the plant output. 


Step 2. Synthesize an optimal state-feedback regulator by setting 
M =C, Q = Qo + qI and R = I (or Q = I and R = p!), and increase q 
(reduce p) until the return ratio at the output of the compensated 
plant has converged sufficiently closely to — C(jwI — A)~'K, overa 
sufficiently large range of frequencies. 


5.6 Shaping the principal gains 


In order to exploit the LTR technique, we must know how to modify Q and R 
in order to bring about desirable changes in K,(s! — A)"'B or, if we are 
using the dual procedure, how to modify W and V in order to bring about 
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Figure 5.5 Generation and optimal filtering of y (for u=0). 


desirable changes in C(s] — A)" ‘Ky. To obtain an intuitive grasp of this, 
consider the Kalman filter. 

Let us assume that u = 0, for simplicity. Since u acts equally on x and 
$, we do not lose anything essential by this simplification. Figure 5.5 shows 
the stochastic process y being generated, and being processed by the Kalman 
filter to produce the state estimate xX. The dashed-line rectangle shown in 
Figure 5.5 has the transfer function G,(s) which appears in equation (5.42). 
Now, since the process noise v is uncorrelated with either Cx or (past values 
of) CÀ, the statistical properties of the state estimate $ are unchanged if we 
move v into the feedback path of the Kalman filter, as shown in Figure 5.6. 
This now looks like a feedback system which is to track (in a sense) the 
‘reference input’ z, while rejecting the measurement errors v, and we are 
already familiar with the trade-off between these two conflicting objectives. 

Roughly speaking, if the power spectral density of z increases relative 
to that of v at any frequency, then the loop gain should also increase at that 
frequency, because more importance can be attached to tracking z than to 
suppressing v. So we expect to increase the bandwidth of the Kalman filter by 
increasing IT WT" relative to V, and to decrease the bandwidth by increasing 
V. Indeed, from equation (5.50) we see that if W is made very large (q large in 
(5.50)) then an increase in W by a factor « leads to an increase in P+, and hence 
in Kp, by a factor \/«. 

Such adjustments are useful for ‘speeding up’ or ‘slowing down’ the 
Kalman filter uniformly in all signal directions. But they do not help if the 
shapes of the principal-gain plots need to be changed. To do this we can do 
one of two things: modify the matrices [WIT and V in a more sophisticated 
way, or modify the plant model by augmenting it with additional dynamics. 
We shall now examine these possibilities. 

One common requirement is to make the principal gains of the return 
ratio similar to each other in the vicinity of the required 0dB cross-over 
frequency. This is appropriate if the plant has potentially uniform dynamics 
(with realistic control signals), or if the unstructured uncertainty is ‘uniform’ 
(that is, similar in all output directions). 

We can use TWTT to increase the smallest principal gain of the 
sensitivity matrix, or decrease the largest one near some particular frequency. 
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Figure 5.6 Statistically equivalent model to that shown in Figure 5.5. 


Suppose we choose V = I. Then, from (5.42), we have 
F,(jo) Ff (jo) = I + G (jo) WGP (jo) (5.69) 


from which it follows that 


oLF(jo)] = {1 + o? EG (jo) WP? (5.70) 
Hence 

LF, (jo)] = {14+ 0° LG (jo) Ww] (5.71) 
and 

o[ Fe *(jo)] = {1+ 6° [G,(ja) WP 7? (5.72) 


so we can reduce a[F; '(jw)] by increasing o[G,(jw) W12], etc. But the 
point is not merely to reduce all the principal gains of FF ' (jæ), but to reduce 
the largest one, relative to the smallest. One way of doing this is first to obtain 
the singular-value decomposition of G;(j@)W' at the frequency at which 
the adjustment is to be made (œ, say) 


G,(jo,)W? = UZV" (5.73) 
= ¥ u,v! (5.74) 
i=1 


where {u;} and {v;} are the input and output principal directions, respect- 
ively. 


Now replace W"? by W"? (I + avv"). Then we have 


G;(jo,)W*? (1 + avot) = ¥ uo + (1+ a)ojujv (5.75) 


ixj 
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Figure 5.7 Generation and filtering of y when w is not white. 


so that the jth principal gain has been changed by the factor 1+. (at 
frequency w,), while all the other principal gains have been left unchanged. 
So to reduce o choose j = 1, and « < 0; to increase g choose j = m, and « > 0. 
The problem with this approach is that v; is usually a complex vector, 
whereas we wish to keep W+? real. Once again we are faced with the problem 
of approximating a complex matrix by a real matrix, and as before we can 
employ the ALIGN algorithm to solve it. In this case other algorithms may 
be more appropriate, however, since we really want to approximate v; rather 
than ‘align’ with it. In particular, Re {v,} is sometimes an adequate approxi- 
mation. A further possibility is to approximate v; by the left singular vector of 
the matrix [Re {v;} Im {v,}] which corresponds to its largest singular value.” 
It is not usually sufficient to modify the matrices W and V. The gains 
in particular frequency regions can be manipulated by altering the power 
spectral density matrices of z and v (see Figures 5.5 and 5.6) in those regions, 
and this can be achieved in practice by augmenting the plant model with 
additional dynamics. 
Let us recall equation (5.42), 


F,(s) VF{(—s) = V + G,(s) WG; (~s) 


and note that the expression on the right-hand side of this equation is just the 
power spectral density matrix of y, the measured plant output. If the process 
noise w is not white, but has a power spectral density W(s), then (5.42) 
generalizes to 


| F(s) VF (—s) = V + G(s) Wi(s), Gp (—s) (5.76) 


that is, the expression on the right remains the power spectral density of the 
measured plant output. In this case we can replace Figure 5.5 by Figure 5.7, 
which shows w being generated as the output of a transfer function H(s) 
driven by a white process v, where 


H(s)H™(—s) = W(s) (5.77) 
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and E{vy"} = I. This is no different, in essence, from the situation depicted in 
Figure 5.5, with C(sI — A)~‘T replaced by C(sI — A)” +T H(s). In order to 
perform this replacement in practice, we need a state-space model of H (s): 


= Awt + Buv, w= Cui + Duy (5.78) 


If we now use (Ap, Bp, Cp) to denote the state-space model of the plant, then, 
recalling that 


x= Appx + Bu + Iw, y=C,x+0 (5.79) 


we obtain 
Hew ve Welle bel eb 5.80) 
y=[C, of Fe 


These equations have the form of (5.1) and (5.2), and so fit into the LQG 
framework. 

One common application of this augmentation is to force ‘integral 
action’ into the LQG compensator. The purpose of integral action is to 
obtain zero sensitivity at œ = 0, by obtaining infinite open-loop gains at 
@=0. Our previous considerations lead us to expect that this will be 
achieved if we give the w process an unbounded power spectral density at 0 — 
that is, if we place eigenvalues of A,, at the origin. More formally, we require 
that 


a [F;(0)] = œ (5.81) 
which implies (because of (5.76) and (5.77)) that 
o[G,(0) H(0)] = œ% (5.82) 


which is achieved by placing poles of H(s) at the origin, unless G,(s) already 
has them there. 

In practice, one would usually start by introducing the augmented 
model (5.80) at an early stage of the design, at a later stage manipulating the 
covariance matrices to balance the gains and adjust the cross-over frequency, 
etc. The augmented model is carried through the various stages of the 
computations. 

There is a minor complication which arises during the recovery step, 
however. It is seen from (5.80) that the dynamics of ¢ are not controllable 
from the plant input. So if we place eigerivalues of A,, at the origin, we violate 
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Figure 5.8 Feedback loop with plant output disturbances and measurement noise. 


the assumption of asymptotic stabilizability which is required when solving 
the Riccati equation (5.12) at the recovery stage. Problems with the solution 
algorithm can be avoided by placing the eigenvalues of A,, not exactly at the 
origin, but just to the left of it, the distance being much smaller than the 
required bandwidth.* 

To boost the gain over a small range of frequencies we can introduce 
some lightly damped eigenvalues of A, (resonant poles). To produce a 
‘notch’, namely a drop in the gain over a small frequency range, we can either 
introduce zeros into the spectral density of w (by suitable choice of B, and 
C,,) or, more easily, augment the model in such a way that the spectral 
density of the measurement noise v is increased over that frequency range. 
The reader should be able to work out the details of this, on the basis of the 
material we have already presented. 

In this section we have been assuming that the dual of the LTR 
procedure described in Section 5.4 is followed, namely that the Kalman 
filter’s return ratio is manipulated until a satisfactory design has been 
obtained, and that it is then ‘recovered’ at the plant output by synthesizing a 
state-feedback controller automatically. Of course, everything we have said in 
this section is directly applicable to LTR at the plant input, if the roles of W 
and Q, and of V and R, are interchanged. In particular, since we have used 
augmentation of the plant model to replace the constant covariances W and 
V by power spectral densities W(s) and V(s), we can use similar augmen- 
tation to replace the constant weighting matrices Q and R by frequency- 
dependent weights Q(s) and R(s). 

Now let us return to the general LQG problem, without assuming 
that the LTR procedure is to be used. Suppose that we have plant disturb- 
ances d instead of state disturbances w, as in Figure 5.8, and that these 
disturbances have power spectral density D(s). Define z to be the plant output 
before corruption by measurement noise, as in Figure 5.8. The measurement 
noise v need not be white, and we can assume d and v to be generated by the 
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processes 
= Aat + Bay, d=Cgl (5.83) 
H=Ant+By v=Cyyt+@ (5.84) 


where {6}, {u} and {v} are white-noise processes. 
Combining this with equations (5.1) and (5.2) we obtain 


X x 0 0 x B 0 0 
: v 
€¢}=| 0 A 0 cfelofusj e ofi] 
ý 0 A, n 0 0 B 
(5.85) 
x 
y=[ C Ca “Cy Sop ee (5.86) 
4 
and 
x 
z=[ C, Cy 0])] ¢ (5.87) 
n 


We have given equations (5.85)-(5.87) explicitly in order to confirm that we 
still have equations in the form of (5.1), (5.2) and (5.6), so that we are still 
within the LQG framework. Note that we require E {007} > 0, E{Ov"} =0 
and E {0u"} =0 in order to satisfy our earlier assumptions. We also require 
A, and A, to have all their eigenvalues in the open left-half plane, since ¢ and 
yn are uncontrollable from u. 

From (3.61) we obtain 


J = E{z'Qz+u' Ru} 
= x {tr [QZ(w)] + tr [RU (@)]} do (5.88) 


where Z(œ) and U(œ) denote the power spectral densities of z and u, 
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respectively. Furthermore, using (3.66), we can express this as 


1 oO 


ae 3 


[Zoto Talio) Deo 


+5 oa LO Talo) V (jo)] 


+99? [R"? Tuljo) D*? (jo)] 


+ dof [R*? T,,(jo) roy de (5.89) 
where 
2(s) = T,4(s)d(s) + T,,(s)v(s) (5.90) 
and 
u(s) = T,a(s)d(s) + T,,,(s)v(s) (5.91) 


From Figure 5.8 (or from Chapter 1) we have that 


T,a(s) = S(s) (5.92) 

T,,(s)= I — S(s) (5.93) 

Ta(s)= — K(s)F; *(s) (5.94) 

T,(8)= Tyas) (5.95) 
where 

S(s) = [1 + G(s) K(s)]~’ (5.96) 
and 

F,(s) = I + K(s)G(s) (5.97) 


This shows that the solution to the LQG problem which we have set 
up minimizes the principal gains of the sensitivity S(j@), of the closed-loop 
response I — S(jm) and of K(jw)F; ‘(jm), each of these being weighted by 
D(jo), V(j@), Q and R, as shown by (5.89). By further augmenting the plant 
dynamics (in addition to the augmentation shown in (5.85)) we can replace Q 
and R by frequency-sensitive weights Q(jw) and R(jq@). Safonov et al. (1981) 
advocate pursuing LQG design from this point of view, adjusting each of the 
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four ‘parameter’ matrices of the problem to shape the closed-loop principal 
gains as required. If the greatest singular value of I — S (jæ) is too large, for 
instance, indicating poor robustness properties, then V'/*(jw)Q'/? (jw) 
should be used to increase the weighting in the direction of the singular vector 
associated with the largest singular value (at the appropriate frequency). 

An explicit connection between this approach to LQG and the LTR 
approach has been pointed out by Stein and Athans (1987). We saw in 
Section 5.5 that, in order to recover a Kalman filter’s return ratio at the plant 
output, we set Q = Fand R = pI, and reduce p towards zero. In this case we 
get, from (5.89) 


: E : : 
lim Jizz = =Í Lio [Tajo D (jo)] 
p70 T i 


+ 7[ T,,(jo) V? (ja)]} do (5.98) 


and we see that the LTR procedure, applied at the plant output, trades off the 
sensitivity S(j@) against the closed-loop response I — S(ja), with a relative 
weighting W(jw@) = D*!?(jw)V~ +? (ja). (See Exercise 5.2 for the dual result.) 

Stein and Athans (1987) also develop the theory of ‘loop 
shaping’— namely, how to implement the weighting W(s)—in a more formal 
way than we have done here. 


5.7 Some practical considerations 


It is evident from equation (5.62) that a compensator obtained by the LTR 
procedure inverts the stable plant dynamics. (It cannot cancel right half-plane 
poles or zeros, since this would lead to an internally unstable system, and 
LQG compensators are known to guarantee the internal stability of the 
closed loop.) One should therefore be careful to specify a return ratio to be 
recovered which is as compatible as possible with the plant dynamics. If the 
plant has lightly damped resonant poles, while the recovered return ratio 
shows no trace of the resonance, then the compensator will inevitably contain 
zeros which cancel those poles. As discussed in Chapter 4, this may or may 
not be acceptable, depending on how accurately the resonant poles can be 
located. 

Trying to suppress the effects of lightly damped poles or zeros also 
leads to difficulties with the recovery procedure itself. The step from equation 
(5.62) to equation (5.63) is not valid at poles or zeros of the plant, or indeed of 
the return difference J] + K,@B, since some of the inverses will not exist at 
such points. Although the limiting process (5.60)—(5.63) is valid in any neigh- 
bourhood of a pole or a zero, the rate of convergence (with q) may be very 
slow. The practical consequence of this is that if one tries to suppress (or 
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insert) dynamics which correspond to poles or zeros located very near the 
imaginary axis, then one may need to increase the ‘recovery parameter’ q to 
extremely large values in order to get a good degree of approximation at 
frequencies close to those pole or zero locations. And this, in turn, leads to 
recovery of the 20 dB/decade ‘roll-off’ rate well beyond the loop bandwidth, 
and thus to unnecessarily low attenuation of measurement noise. This 
situation can be ameliorated, however, either by further dynamic augmen- 
tation of the plant model, or by some judicious low-pass filtering, applied in 
an ad hoc manner. 

If the plant to be controlled has unequal numbers of inputs and 
outputs, then the LTR procedure cannot be applied directly. It remains 
possible to adopt the viewpoint of Safonov et al. (1981), and to adjust the 
various ‘parameter’ matrices appearing in (5.89). It is also sometimes possible 
to apply the LTR procedure, after augmenting the plant with dummy inputs 
or outputs. If the plant has more outputs than inputs, it may be augmented 
with dummy inputs (taking care not to introduce any right half-plane zeros), 
and the return ratio K(s)G(s) at the plant input can be shaped by the LTR 
procedure. In this case (5.51) holds, so the ideal state-feedback return ratio 
can be recovered at the plant input. The dual of (5.51) does not hold, however, 
so recovery cannot be obtained at the plant output. If the plant has more 
inputs than outputs then it may be augmented with dummy outputs, and 
recovery can be obtained at the output, but not at the input. Further details 
of suitable augmentation procedures are given by Doyle and Stein (1981). 


5.8 Design example 


We shall now use the LQG/LTR approach to design a compensator for the 
AIRC aircraft model (see the Appendix). Initially, we shall adopt similar 
specifications to those used in Chapter 4: we aim for a bandwidth of about 
10rads~*, and integral action in each loop, with well-damped responses. 

We shall design the return ratio of a Kalman filter, and recover this at 
the output of the compensated plant. We shall judge trial designs entirely on 
the basis of principal-gain plots, since the stability margins of the Kalman 
filter are guaranteed to be good. 


5.8.1 Kalman-filter design 


We need to choose the matrices I, W and V which appear in (5.1) and (5.3), 
solve the corresponding Riccati equation (5.14), and obtain the Kalman-filter 
gain K, from (5.13). We shall write 


K; = LQE(4,T, C, W, V) (5.99) 
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to denote that K; is obtained from T, W and V in this way,’ with A and C 
taken from the plant model. It is generally advisable to start with simple 
choices of T, W and V, inspect the resulting Kalman filter return ratio, adjust 
T, W and V accordingly, and so gradually improve the return ratio. One of 
the simplest possible choices is F = B, W = I, and V = I}. The choice = B 
corresponds to disturbances on the plant acting through its inputs, rather 
than directly on the state, and leads to the covariance matrix W having the 
same dimension as the number of loops (if the plant is square), rather than the 
number of states. 
We have 


Ke, = LQE(A, B,C, I}, I3) (5.100) 


0.9897 0.0732 — 0.2507 
0.0732 0.9436 — 0.0642 
= — 0.2507 — 0.0642 1.7807 (5.101) 
— 0.4934 — 0.0485 1.6190 
— 0.8076 — 0.2483 22215 


The return ratio — C(sI — A)~'K,, has principal gains as shown in 
Figure 5.9. One of these shows a slope of — 20dB/decade at very low 
frequencies, because of the pole at the origin which this plant contains, but 
the other two show constant values of gain at low frequencies, of about 
16dB and 2 dB, respectively. The open-loop bandwidth is seen to be about 
lrads~', or rather less, in some signal directions. 

The first thing to do is to insert integral action, by augmenting the 
plant model. As we mentioned before, placing poles of the augmented model 
at the origin leads to problems in the recovery step later, so in this case we 
place them at — 0.001, which is virtually at the origin, when compared with 
the required bandwidth of 10rads~'. Choosing the simplest possible aug- 
mentation of the plant gives 


(5.102) 
Cy, = l, Dy = 03, 3 


As was the case with the characteristic-locus and the Nyquist-array design 
methods examined in Chapter 4, we have inserted three integrators rather 
than two, even though the plant itself contains an integrator. We could have 
introduced just two integrators, and chosen B, and C,, carefully, to ensure 
that integration was present for every signal direction, but this would have 
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made the procedure vulnerable to errors in plant modelling, which could 
result in integration being absent from some loop. 

Also, it is often advisable to have integral action at each compensator 
output, in order to overcome any friction which may be present in the 
actuators. 

We could also have chosen C,, more carefully, with the aim of 
adjusting the low-frequency gains. l 

The augmented model is, using equation (5.80), 


rD 
C=C 0], D,=03,3, D= | "| (5.103) 


Now we have 


Ke = LQE (4,, T., Ca, I3, I3) (5.104) 
1.4129 0.0909 — 0.1648 
0.0909 1.3501 — 0.0539 
— 0.1648 —0.0539 2.0429 
—0.4556 0.0062 2.1018 
= (5.105) 
— 1.2024 — 0.2823 2.4314 
— 0.6484 — 0.0329 0.7587 
0.0225 0.9964 0.0624 


|_— 0.7573 0.0576 — 0.6463 


The principal gains of the return ratio — C,(sf — Aa) ‘Kg, are shown in 
Figure 5.9. Integral action is clearly visible in each signal direction, and the 
gains at 0.001 rad s~ * have all been increased by 60 dB, as would be expected 
for integrators of the form 1/s. Note that at frequencies above trads~’, 
where the gain of the augmenting system (Aw, Bw, Cw, Dw) is comparable with 
that of the plant, the gains are almost unchanged. 

Next we shall attempt to increase the smallest principal gain at low 
frequencies (œ < 0.1 rads™+, say) in order to speed up the elimination of 
steady-state errors. Increasing it by a factor of 10 (20 dB) would make the two 
smaller principal gains similar at low frequencies, and we shall aim to do this. 
We use (5.71) but note that, since [Fp '(j0.001)] ~ — 55dB « OdB, we 


can replace (5.71) by 


o[C,(j0.001 — 44)” +K] x o[C,(j0.0011— A,)-1T,] (5.106) 
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Figure 5.9 Open-loop principal gains of initial Kalman filter, with filter gains K; 
(dash-dotted curves), K;, (dashed curves) and K;, (solid curves). 


In other words we can, as usual, work with open-loop gains instead of closed- 
loop gains, if the loop gains are large (or small) enough. The singular-value 
decomposition (5.73) is, in this case, 


C,(j0.0011 — 4) tT, = UZV# (5.107) 


which gives 


0.509 
vs = | 0.158 + 0.0005; (5.108) 
0.846 — 0.0005] 


as the third column of V. 


In this case Re {v3} is a very good approximation to v3, so, setting 
a = 9 (since 1 + « = 10, see (5.75)), we have 


Wi? = 1, + 9Re{v,} Re {v3} (5.109) 
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Now we set W; = W3?(W4/7)' and obtain the corresponding Kalman- 
filter gain as 


K;3 = LQE(A,, | Bee C,, W,,1;) (5.110) 


2.7724 —0.3719 —0.5012 

— 0.3719 1.5375 0.1460 

— 0.5012 0.1460 2.5049 

— 1.1123 0.5385 3.2734 

= (5.111) 
—46051 1.2121 4.4222 

— 3.9426 1.6448 2.8887 

— 0.8835 1.4814 0.7096 


— 7.2415 3.1669 3.0434 | 


Figure 5.9 shows the principal gains of —C,(sI — 4,) 'Kş3. The previous 
smallest principal gain is seen to have been increased by almost exactly 20 dB 
at low frequencies, while the other two have remained unchanged. The 
smallest principal gain has been increased significantly at all frequencies 
below 1 rads~', the increase at 0.1 rads~* being about 17dB, and has 
remained almost unchanged at higher frequencies. At the same time, the 
largest principal gain has been increased by about 4dB at high frequencies. 

Figure 5.9 indicates that the 0 dB cross-over frequency of the compen- 
sated system is between 1.3 and 4rads~*, whereas we would like it to be 
about 7rads~* (= 10/,/2, approximately) or higher in order to obtain a 
bandwidth of about 10 rads~+. If all the gains were increased by 10 dB (that 
is, by a factor of ,/10) at all frequencies, then the largest principal gain would 
have a 0dB cross-over frequency very close to 10 rads" +. Increasing W, by a 
factor of 10 gives a 10dB increase at low frequencies, but only about 5dB 
near 10rads~', and it turns out that an increase by a factor of 100 is needed 
to obtain a 10dB increase near 10rads~*. Figure 5.10 shows the largest 
principal gains obtained using W,, 10 W, and 100 W3. 

Now let 


Kreg = LQE(A,,I,, C, 100 W3, 15) (5.112) 


and let us examine the closed-loop properties which would result if we 
recovered the return ratio — C,(sI — Aa) ‘Ky, at the plant output. Figure 
5.11 shows the principal gains of both the sensitivity 


Sels) = [I + C,(sI — Aa) * Kea]? (5.113) 
and the closed-loop transfer function 


T,(s) = I — S,(s) (5.114) 


Gain (dB) 


Gain (dB) 


160 


DESIGN EXAMPLE 


TT Tri ae Cie Ban rT TTR 


140 Poe 


107? 107! 10° 10' 


Frequency (rads~!) 


10? 


Figure 5.10 Open-loop principal gains with W3, 10W; and 100 W3. 
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Figure 5.11 Principal gains of & and T; using K;,. 
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This shows that the bandwidth specification has been approximately satis- 
fied, the — 3dB frequency of 7;(s) being between about 6.5 and 12rads~', 
and that of S;(s) being between about 2.5 and 5.5 rads~ +. Reasonably good 
stability margins are of course guaranteed, and we can see from Figure 5.11 
that || 7;||,, = 1.6 (4dB), approximately. We could therefore terminate the 
Kalman-filter design at this point, and move on to the recovery step. 
However, in order to demonstrate that quite fine control can be exercised 
over the details of closed-loop performance, we shall suppose that we wish to 
improve the sensitivity further. 

Specifically, we shall try to make all the principal gains of the 
sensitivity S; behave similarly as they approach 0dB, by giving them all a 
value of 1/,/2 (— 3dB) at 5.5rads~*. To do this, we need to obtain the 
singular-value decomposition (see (5.73)) 


C,(j5.51 — A,)71T,(100 W4)"? = UEV” (5.115) 


which gives 


xX = diag {0.935, 0.379, 0.119} (5.116) 
and 
0.723 — 0.481 0.496 
V =| 0.300 — 0.0305; 0.859 — 0.110) 0.395 — 0.0621; 
0.621 — 0.0368} 0.140 + 0.0026; — 0.769 + 0.0562) _ 
(5.117) 


(W12? is available from (5.109). To obtain the square root of W in general 
requires an algorithm such as Cholesky decomposition to be available — see 
Golub and Van Loan (1983).) We proceed by using equations (5.70) and 
(5.75), as we did earlier, but now we cannot work with open-loop approxi- 
mations since the loop gains are close to unity. The principal gains of the 
Kalman filter’s return difference, with the current design, are 


{aiL Fe(j5.5)]} = {1.369, 1.069, 1.007} (5.118) 


and we wish to force these to have the values {,/2,,/2,,/2}. From equations 
(5.70) and (5.75) we find that this requires a, = 0.069, a, = 1.639 and 
a, = 7.409. Again, V is given to a good approximation by its real part, so we 
set 

Wi? = 10W12 (I + a, Re {vvt} (U + a, Re {u,v} }) 


x (I + a3 Re {v3v$}) (5.119) 
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Figure 5.12 Principal gains of S, and 7; using Kes. 


Then W; = W2/*(W??)", and 


K;s = LQE (4,, T., C,, Ws, 13) (5.120) 
9.7235 — 0.2568 0.5814 
— 0.2568 7.2970  — 0.1420 
0.5814 — 0.1420 9.5072 
6.5138 — 1.2568 45,3725 
= (5.121) 
> 46.8171 1.9765 6.0590 
— 101.7203 7.3047 — 11.6414 
— 11.1370 27.7714  — 1.4828 


{ — 265.8522 22.2337 — 115.6172 


Figure 5.12 shows the principal gains of the sensitivity S; and the 
closed-loop transfer function 7; of the new design, and it can be seen that 
the principal gains have indeed been brought very close to each other in the 
neighbourhood of 5.5rads~*. This has been achieved at the expense of a 
larger bandwidth of 7;, but with very little increase in || 7;||,,, and hence very 
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Figure 5.13 Open-loop principal gains after balancing at 5.5 rad s 


little deterioration of stability margins. In fact, some benefit has been ob- 
tained from the fact that measurement noise is now being amplified 
(o;(7;) > 1) over a smaller range of frequencies than before. As expected, the 
system looks more uniform in all signal directions; for example, the range 
of —3dB frequencies of 7; has been reduced from 6.4 < œ < 12.2 to 
10.7 < œ < 15.0. 

Figure 5.13 shows the principal gains of the open-loop return ratio 
—C,(sI — A,)~ 1 Krys; all the gains have clearly been ‘squeezed together’ near 
10 rads~', and at higher frequencies. Figure 5.14 shows the characteristic loci 
of this return ratio, together with the unit circle centred on — 1. All the loci 
remain outside or on the boundary of this circle, as predicted by Kalman- 
filter theory, from which (and from Figure 5.12) we know that o,(S,;) < 1, and 
a necessary condition for this to be true is that the characteristic loci do not 
penetrate into the circle. 


5.8.2 Recovery at the plant output 


Having obtained a satisfactory return ratio for the Kalman filter, we now 
recover this return ratio at the output of the plant. The plant has no 
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Figure 5.14 Characteristic loci of Kalman filter's return ratio. 


transmission zeros in the right half-plane (in fact it has no finite zeros at all), 
so arbitrarily good recovery should be possible. 

To obtain LTR, we solve the Riccati equation (5.12) with M = C,, 
Q = I and R = pl, and obtain the corresponding state-feedback matrix K, 
from (5.11), using B, in place of B. We shall write 


K, = LQR(A,, B,, CT C,, p13) (5.122) 


to denote that K, has been obtained in this way.” Perfect recovery should be 
obtained in the limit, as p is reduced to zero. 
Once K, has been found, a state-space realization of the compensator 
K(s) is given by (KA, KB, KC, KD), where 


KA = A, — B,K. — K;5C, (5.123) 


KB = Kys, KC =— —K,, KD z= 03, 3 


(see Figure 5.2). Figure 5.15 shows the principal gains of G(s)K (s), together 
with those of C (sI — Aa) Kes, for p = 1077, 1074, 1076 and 1078. The 
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Figure 5.15 Loop transfer recovery for (a) p= 107”, (b) p= 1074, (c) p=107 6 and 
(d) p=107°. ! 
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Figure 5.15 (cont.) 
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Figure 5.16 Characteristic loci at output of compensated plant, for p=10°° 
(dashed curves) and p= 107 § (solid curves). 


value p = 107° appears to give very good recovery, and one might think that 
there is no need to reduce p further. However, Figure 5.16 shows the 
characteristic loci of G(s)K(s) for p = 1076 and 107°, and it is seen that the 
increased ‘roll-off’ rate which is visible in Figure 5.15(c) reduces stability 
margins considerably. Figure 5.17 shows the principal gains of the sensitivity 
S(s) and of the closed-loop transfer function T(s) for the two cases. 
Figure 5.17(a), for p= 107%, shows lower high-frequency gain than 
Figure 5.17(b), but significantly reduced stability margins and increased 
sensitivity. We shall therefore adopt the compensator with p = 1078. The 
corresponding optimal state-feedback matrix is 


K, = [Keis I] 
where 
— 5960 — 1158 7853 36.3 82.0 
K. = — 700 9933 910 4.3 9.8 (5.124) 


— 8000 —7.8  —6205 —794 180 | 
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Figure 5.17 Principal gains of S and Twith (a) p=10~° and (b) p=1078. 
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Figure 5.18 Closed-loop step responses to step demand on output 1 (solid 
curves), output 2 (dashed curves) and output 3 (dotted curves). 


When implementing the compensator, A, in (5.123) should be replaced 
by 


5 A TC 
A= F 0 | (5.125) 
(and not by A, as given by (5.103)) in order to obtain true integration. 
With this change, the closed-loop responses which result from step demands 
being applied to each of the three outputs are as shown in Figure 5.18. 
The closed-loop poles are located at — 3.661 + 3.668j, — 2.600 + 4.369), 
— 2.606 + 4.676), — 5.195 + 0.154j, — 48.14 + 48.14j, — 156.9 + 156.9} and 
— 10072, which shows that the compensator does not attempt to cancel the 
plant’s resonant poles (which lie at — 0.018 + 0.183)). 


5.8.3 Comparison with previous designs 


When using the LQG/LTR technique, the design process is quite different to 
that required with the characteristic-locus or Nyquist-array methods. There 
is'no reduction of the multivariable problem to a set of SISO problems, and 
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the fact that stability and stability margins are guaranteed leads to an absence 
of explicit concern with phase compensation. 

We have not attempted to reduce interaction, as we did in the 
designs examined in Chapter 4. In the present case, Figure 5.18 shows that 
interaction has been removed, even without having taken explicit steps to 
do so. 

The state dimension of the LQG compensator is the same as that of 
the augmented plant model, and hence is 8 for our design, which is larger than 
that required for either of the two design examples presented in Chapter 4. It 
may be possible to find a lower-order approximation to this compensator, 
which has a very similar frequency response, using the model-reduction 
algorithm discussed in Section 8.3.9. 


5.9 Non-minimum-phase plant 


The LTR procedure can be applied to unstable plant without difficulty, since 
the unstable poles are shifted into the left half-plane by the feedback, and 
no unstable pole—zero cancellations are implied. But recovery cannot be 
achieved for plants with right half-plane zeros, as already mentioned. 
There are two ways of viewing the reason for this. The first is that recovery 
of an arbitrary return ratio would in general require unstable pole—zero 
cancellations to be introduced by the compensator, and no LQG compen- 
sator will do this. The second is to note that the state-feedback return ratio 
— K,(sf — A)~'Band the Kalman-filter return ratio — C(sI — A)~'K, both 
have all their zeros in the left half-plane. If this were not so, these return ratios 
would not have infinite gain margins. But any return ratio at the plant input 
or output which does not involve unstable pole—-zero cancellations must 
have the right half-plane zeros of the plant, and therefore cannot be 
forced to approach either the state-feedback or the Kalman-filter return 
ratio. 

This second point of view suggests that recovery may be achievable at 
those frequencies at which the plant’s response is very close to that of a 
minimum-phase plant -that is, at frequencies which are small compared with 
the distance from the origin to any right half-plane zero. This is indeed the 
case, so the simplest strategy to use with non-minimum-phase plants is to 
follow the usual LTR procedure and hope for the best. This will certainly give 
closed-loop stability, it will still give an optimal trade-off between S(jœ) and 
I — S(jq) if LTR is applied at the plant output, and it will give some degree of 
recovery of the ideal return ratio. If the right half-plane zeros lie well outside 
the required bandwidth, then adequate recovery of the ideal characteristics 
should be achieved at all significant frequencies. If any of them lie within the 
required bandwidth, then of course we can expect problems, whatever design 
technique we choose. 
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The explanation of the success of the approach described above lies in 
the theory of optimal root loci (Kwakernaak and Sivan, 1972). This shows 
that, as the LTR ‘recovery parameter’ q (or p) is made large (or small, 
respectively), each left half-plane zero of the plant ‘attracts’ one of the 
compensator poles. (The mirror image, about the imaginary axis, of 
each right half-plane zero, also ‘attracts’ one compensator pole.) All the 
‘minimum-phase dynamics’ of the plant are therefore cancelled by the 
LTR compensator. 

An alternative and more complicated strategy is to approximate the 
plant by one having only left half-plane zeros. If the plant is factored as 


G(s) = A(s)Go(s) (5.128) 


where G,(s) has only left half-plane zeros, and A(s) is a stable all-pass transfer 
function, namely one such that 


A(jo) A"(jo@) = 1 (5.129) 


then the application of the LTR procedure to Go(s) will result in perfect 
recovery of the open-loop gain characteristics at the input or output of the 
true plant G(s). Stability margins will be worse than predicted for Go(s), 
however, and indeed closed-loop stability will no longer be guaranteed. On 
the zero locations of A(s), and this can be allowed for when designing 
the return ratio which is to be recovered. Algorithms for obtaining state- 
space realizations of A(s) and Go(s) are discussed by Stein and Athans 
(1987). 

A third strategy for dealing with non-minimum-phase plant is to 
continue to use the LTR procedure, but to recover something other than the 
return ratio of an optimal state-feedback regulator or of a Kalman filter. If 
Section 5.4 is reviewed, it will be seen that recovery of the return ratio 
— K,(s! — A)~'B at the plant input does not depend on K, being an optimal 
state-feedback matrix. Any stabilizing state-feedback matrix, such as one 
obtained by some pole-placement algorithm, could be used and recovered 
just as well. Also, it can be shown (Stein and Athans, 1987) that any return 
ratio which shares the right half-plane pole and zero structure of the plant 
can be recovered by the LTR process. Thus the LQG/LTR approach can be 
replaced by a more general ‘state-feedback/LTR’ or ‘state-observer/LTR’ 
approach, which can be applied to non-minimum-phase plant. The usefulness 
of such an approach is questionable, however. The original motives for 
introducing the LQG/LTR technique were to allow the good robustness and 
performance properties of the linear quadratic regulator or the Kalman filter 
to be obtained, and to simplify the LQG design procedure. If both of these are 
now discarded, then LTR remains of interest only if some powerful methodol- 
ogy is brought in to replace the discarded half of LQG/LTR. 
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SUMMARY 


In this chapter we have introduced the linear quadratic Gaussian (LQG) 
control problem, and outlined its solution. We have then shown how the 
general LQG problem can be restricted (LQG/LTR) in such a way that it 
yields a useful and fairly straightforward method of feedback design. The 
use of LQG methods guarantees internal stability, so the designer can shift 
attention from ‘phase compensation’ to ‘gain shaping’ in order to achieve 
desirable performance and robustness properties. 

Adjusting the various covariance and cost-weighting matrices which 
appear in the LQG problem statement has been shown to be equivalent to 
adjusting the trade-off between the principal gains of various closed-loop 
transfer functions. In particular, the LQG/LTR procedure, applied at the 
output of the plant, has been shown to trade off the principal gains of the 
sensitivity S(s) against those of T(s) = J — S(s). Furthermore, this interpret- 
ation continues to hold for plants with right half-plane zeros, when loop 
transfer recovery cannot be guaranteed to work. 

In this chapter we have assumed that the plant can be described by a 
rational transfer-function matrix, or equivalently, by a state-space model of 
finite state dimension. 


Notes 


1. We emphasize that the separation principle holds only when all three assump- 
tions are satisfied: linear model, quadratic cost (J) and Gaussian stochastic 
processes (w and v). 


2. This was suggested by Matt Wette, a graduate student at the University of 
California, Santa Barbara. It can be shown that this approximation achieves 
the smallest possible ‘gap’ between v, and any real vector. (Gap’ here means 
the sine of the angle between two vectors -see Golub and Van Loan (1983).) 


3. The state estimate X is given in terms of the plant output (for u = 0) by 
X(s) = (sI — A)” *K,F 1 (s)y(s). Thus equation (5.76) shows that the filter 
action can always be interpreted as creating a white process Fç '(s)y(s) 
{the ‘innovation process’), and then applying this white process to 
{sI — A) 1K, to obtain the optimal estimate x. 


4. It is also possible to use the original plant model, without augmentation, 
during the recovery step. This gives approximate, but often sufficient, LTR. It 
is of course essential to use the full augmented model when implementing the 
compensator. 


5, The notation K; = LQE(A,I,C,W,V) and K, = LQR (A,B, MTQM, R) is 
consistent with that used in the Control System Toolbox software, which is 
available for use with PC-Matlab and Pro-Matlab (see Chapter 8). 
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EXERCISES 


5.1 (Alternative derivation of the LTR result.) Let u, be the signal at the 
input to the matrix B of the LQG compensator (see Figure 5.2), and let 
u be the signal at the plant input. Show that 


X(s) = $(s)[B{Co(s)B}~* — Ke {I + Cols)Ke} ']C G(s) Bu, 
— b()K; {I + Co(s)Ke}~' Co(s)Bu 


where ¢(s) = (sI — A)~*. Hence show that LTR is obtained at the 
plant input if 


K;{I + Co(s)K;}~* = B{C$(s)B}~* 


Show that this condition is satisfied asymptotically if K; > q! BX, as 
q > œ, for any non-singular X. 


5.2 In Section 5.6 it is shown that applying the LTR procedure at 
the plant output is equivalent to trading off the sensitivity S(jq@) 
against I — S(jw). Show that applying the procedure at the plant 
input is equivalent to trading off [I + K(j@)G(jw)]~/ against 
[I + K(joG(jo)]~*K(jo)G(jo). 

(Hint: Consider a disturbance acting at the input of the plant, and a 
weighting function W = R7 120126.) 


5.3 Repeat the LQG/LTR design performed on the AIRC model (see the 
Appendix) in Section 5.8, but aiming for a closed-loop bandwidth of 
about 3rads~‘!. Compare your design with the one obtained as the 
solution of Exercise 4.1. 


5,4 Use the LQG/LTR method to design a feedback compensator for the 
TGEN model (see the Appendix), assuming a control structure with 
one degree of freedom, and the same specifications as in Exercise 4.2. 


5.5 Use the LQG/LTR method to design a feedback compensator for the 
RPV model given in the Appendix. The specification is that ¢[S( jo)] 
is to be minimized, where S(j@) = [I+ G(jw)K(jo)]~* is the sensi- 
tivity, subject to the constraints 


gU — S(jw)] < 2, for all œ 
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and 
7 3 10 
cU — S(jœ)] < v œ > 100 


Examine the step responses of your design. 
(Note that the RPV model is open-loop unstable. You are not 
expected to obtain an exact solution to the minimization problem.) 


e 
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CHAPTER 6 
The Youla Para- 
metrization and H, 
Optimal Control 


6.1 Introduction 6.7 An algorithm for solving 
6.2 A motivating example: sensiti- general H,, problems 
vity minimization 68 Design example 
6.3 The H,, problem formulation 6.9 Review and comments 
64 The Youla (or Q) parametri- Summary 
zation Exercises 
6.5 Solution of the H,, problem References 
6.6 The Hankel approximation 
problem 


6.1 Introduction 


This chapter is primarily concerned with H (pronounced as ‘H-infinity’) 
optimal control. This relatively new approach to feedback design is ex- 
plained, and a complete derivation of a solution for a special case (the 
‘1-block’ problem) is given in Sections 6.4 to 6.6. A better algorithm, how- 
ever, and one which can be applied to general H « Problems, is given in 
Section 6.7 (without any derivation). Our derivation introduces the cele- 
brated ‘Youla parametrization’ of all stabilizing feedback controllers for a 
given plant, which we explore in some detail in Section 6.4. It also introduces 
the Hankel-approximation problem, in Section 6.6. This has important 
applications, particularly to model approximation. 
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External 

inputs Error 
Control j Measured 
signals variables 


Figure 6.1 Standard representation of inaccurately known plant under feedback 
control. 


In Chapter 3 we saw that one can obtain an accurate representation of 
the uncertainty inherent in a plant model in the form of Figure 6.1. Recall that 
in this representation ‘external inputs’ (w) is a vector of all the signals entering 
the system, and ‘error’ (z) is a vector of all the signals required to characterize 
the behaviour of the closed-loop system. Both these vectors may contain 
elements which are abstract in the sense that they may be defined math- 
ematically, but do not represent signals which actually exist at any point in 
the system. u is the vector of control signals, and y is the vector of measured 
outputs. 

In this chapter we shall use the representation shown in Figure 6.2, 
which is the same as Figure 6.1 but with the additive perturbation A(s) not 
shown. P(s) is derived from the nominal plant model, just as in Chapter 3. 
However, it may also include weighting functions which depend on the design 
problem that is being solved. 

Suppose that P(s) is partitioned as 


P,,(s) ad 
P(s)= (6.1) 
(9) P P32(s) i 
so that 
Z=Py,wt+Pi,u,  y=Pzw+P224 (6.2) 


Then we can eliminate u and y, using u= Ky, to obtain 
z=[Pi +P KI —P2K) Pa ]w (6.3) 


The expression enclosed in brackets in equation (6.3) will occur frequently in 
the rest of this chapter, so it will be convenient to have a shorthand notation 
for it. It is becoming conventional to denote it by F\(P, K), so that we can 


rewrite (6.3) as! 


z=F,(P,K)w (6.4) 
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Figure 6.2 As Figure 6.1, but with perturbation A removed. 


By suitably defining w and z (or, equivalently, P), it is possible to put a 
number of practical design problems into the form 


minimize || F\(P, K)|| ,, 


where the minimization is over all realizable controllers K(s) which stabilize 
the closed loop, and ||. ||, is as defined in Section 3.8. 

This is known as the H,,-optimization problem. In this chapter we 
shall show how this problem arises in a few examples, and how to solve it 
using state-space algorithms. 

On the way to the solution of the H „ problem we shall examine the 
structure of all stabilizing controllers for a given plant, and show that the 
tricks we had to use in Chapter 5 to obtain good results with LQG methods 
were in no way special to the use of those methods: all stabilizing controllers 
can be thought of as observer/state-feedback combinations. Furthermore, all 
the closed-loop properties which are usually of interest, such as the sensitivity 
function S(s), the closed-loop transfer function T(s) or, more generally, 
F,(P(s), K(s)), depend in a very simple manner on a single (albeit transfer- 
function matrix valued) parameter. These results are of fundamental import- 
ance in feedback theory, and their interest is not confined to the context of 
H „ design. 

In Chapter 3 we defined ‘H,,’ to be the set of transfer functions of 
asymptotically stable, realizable systems. In this chapter we shall often write 
XeH,, to mean that X is an asymptotically stable, realizable system. The 
term ‘H „ problem’ arises from the fact that we are minimizing || F\(P, K)]||,, 
over all F\(P, K), such that F,\(P, K)e H „and the feedback combination of P 
and K is internally stable. A consistent term for the LQG problem, which is 
sometimes used, is ‘H, problem’, since that requires the minimization of 


| Fi(P, K)||, over all F\(P, K)eH,, again with the constraint of internal 
stability. 


6.2 A motivating example: sensitivity minimization 


Let us begin by looking at a relatively simple example. Consider the familiar 
feedback loop shown in Figure 6.3. To make everything as simple as possible, 
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Figure 6.3 Feedback configuration for motivating example. 


we assume that all the signals are scalars, and that G is stable. Suppose that 
we do not know what disturbances to expect, but we do know that their 
spectrum is essentially confined to the frequency range 0<@<a@,. A plaus- 
ible design goal is to find a controller K which minimizes the worst-case 
excursion of y resulting from any disturbance d (and stabilizes the loop, of 
course), and this could be achieved by solving the problem 


minimize || S || ,, (6.5) 


or, equivalently, 
minimize sup|S(jq)| (6.6) 


where S=(1+GK)~' is the sensitivity function.” In fact this goal can be 
improved by limiting the minimization to the frequency range in which 
disturbances occur, since we know that making the sensitivity small outside 
this range will lead to unnecessary noise amplification and poor stability 
margins. We can do this by finding a (stable, minimum-phase) transfer 
function W(s) with the properties 

|W(jw)|~1, O<@<a, (6.7) 
and 


|W(jo)|«1, o>, (6.8) 


and then posing the problem 
minimize sup|W(j@)S(j@)| (6.9) 


We now make a crucial observation. Suppose that we define 


Q=K(I+GK)"* (6.10) 
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then we can rewrite S in terms of Q and G as 
S=I~GQ (6.11) 


Furthermore, it is easy to check that 


(1+KG) 'K=Q (6.12) 

(I+ KG)" '=I-—QG . (6.13) 
and 

(1+GK)"'G=(I—GQ)G (6.14) 


Now, the four matrices defined in (6.11)—(6.14) are just the matrices H i; which 
appeared in Definition 2.7. Recall that the closed-loop system is internally 
stable if and only if each of these is stable. Since Q appears in each of these 
expressions in a particularly simple manner, and since we have assumed G to 
be stable, we can see immediately that the closed loop will be stable whenever 
Q is stable, because then each of (6.11)-(6.14) contains simply a product of 
stable matrices. What is more, the closed loop will be stable only if Q is stable, 
for if Q were unstable then (6.12) would be unstable, and the whole system 
would be unstable. We can also see from (6.10) that K will be proper 
(realizable) only if Q is proper (because with K proper, Q-K as |s| 00), and 
from (6.15) below we shall be able to deduce that Q will be proper only if K is 
proper. Combining these observations with (6.10), we see that all realizable 
stabilizing controllers are given by 


K=(1-QG)"'0 (6.15) 


where the ‘parameter’ Q ranges over all proper, stable transfer functions. This 
is the celebrated Youla parametrization of all stabilizing controllers. We shall 
see later that this parametrization can be generalized to unstable plant. 


We can now reformulate our optimization problem with the aid of 
(6.11) as 


minimize sup | W(I—GQ)(ja)| (6.16) 


stable Q 


Note that the use of the Youla parametrization has allowed us to formulate 
the design problem as an almost unconstrained optimization problem (the 
only constraint being that Q be proper and stable) with a relatively simple 
criterion function (Q appears linearly inside the modulus symbols). 

This simple example displays many of the features which occur in 
more general H ,, problems. First, the Youla parametrization plays essentially 
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the same role in simplifying the optimization problem as it does in more 
general problems. Secondly, if we write J=WS for the function whose 
magnitude is being minimized, and J* for this function when the optimal 
controller is in place, then it can be shown that J* is a constant. This is not 
generally true, but it is true that | J*(ja)| is constant, namely that J* is an all- 
pass function. This implies that the choice of the weighting function W is 
crucial in the formulation of problems which are sensible engineering prob- 
lems. Features such as the sharpness of the ‘cut-off’ of | W(j)| will determine 
the sharpness with which | S(jo)| rises (above «,), since | S| will have the same 
shape as |W~'|. 

We also remark on those features which are special to this example. 
First, the optimal solution is not actually realizable because it would require 
an improper controller. This can be avoided by use of a more elaborate 
criterion function—for example, penalizing control energy as well as sensi- 
tivity. Secondly, the problem actually has no solution if the plant has no right 
half-plane zeros, since the (weighted) sensitivity can then be made arbitrarily 
small. This simply shows that in our quest for a simple example we have 
oversimplified the problem, and posed one which is not realistic. Thirdly, the 
problem can be solved without any new theory, other than the Youla 
parametrization (Zames, 1981). This is emphatically not the case for other H „ 
problems. 


6.3 The H,, problem formulation 


6.3.1 Examples of H,, problems 


SENSITIVITY MINIMIZATION 


As a first example, let us simply put the sensitivity minimization example 
treated in Section 6.2 into the general form presented in Section 6.1. To do 
this, we use a familiar manipulation to write 
W(1+GK)1=W[I-GK(I+GK)"'] (6.17) 
Since we require that 
F (P, K)=W(U+GK)"? (6.18) 
we deduce, by comparing (6.17) with (6.3), that we need 


Pau =W, P,,.=— WG, Pa, =l, Paa = —G (6.19) 


The reader can check that this is equivalent to choosing w=d and z= Wy, 


THE H, PROBLEM FORMULATION 271 


where d and y are the disturbance and output variables, respectively, «as 
shown in Figure 6.3. 

Note that this reformulation of the problem remains valid if we 
attempt to minimize || WS ||» for a multivariable plant. In this case P,, and 
P,, will always be square, whereas P,, and P}, will have the same dimen- 
sions as G. This is important because the H,, theory becomes considerably 
more complicated if P,, has more rows than columns (or if P,, has more 
columns than rows), although the algorithms required to compute H,, 
solutions remain similar. 


ROBUSTNESS TO ADDITIVE PERTURBATIONS 


Consider the diagram shown in Figure 6.4: the real plant is modelled as a 
nominal plant G in parallel with an unstructured additive perturbation A. 
The only things known about A are that there is a frequency-dependent upper 
bound on its gain: 


6[A(j@)]<|r(jo)|, for each œ (6.20) 


where r(s) is some (scalar) transfer function; and that the number of right half- 
plane poles of G+ A is known and remains constant as A varies. A controller 
K is to be found which will stabilize the loop for all possible perturbations of 
the plant. The stability of the loop is the same as the stability of the loop 
shown in Figure 6.5 (note that we have used a positive-feedback convention 
in Figure 6.4). If K stabilizes the nominal plant G, so that K(1— GKY! is 
stable, then the small-gain theorem (see Chapter 3) tells us that this loop will 
remain stable if || AK(1—GK)~']|,,<1. Now, 


|AK(I—GK)"*||,,=||rr *AK(UI—GK) € ||% 
SIr Allo lI rKU— GK)? ||, 


<| rK(U—-GKY |l% (6.21) 


Figure 6.4 Feedback around a plant with additive uncertainty. 


272 YOULA PARAMETRIZATION AND H,, OPTIMAL CONTROL 


Figure 6.5 Feedback loop with the same stability as that in Figure 6.4. 


Thus robust stability is assured if |rK(I— GKY + ||» <1. Furthermore, it can 
be shown that if this condition is not met, then there exists a A in the expected 
set of perturbations which will destabilize the system. So to establish whether 
it is possible to achieve robust stability of the plant G in the presence of all 
possible perturbations A, and if so to maximize the stability margin, we 
should solve the H „ problem 


minimize ||rK(I—GK)~'||,, (6.22) 


(over all stabilizing K). 
To put this into standard form, we require that 


F,\(P, K)=rK(I—GK)7? (6.23) 
which we obtain by setting 
Pii =0) ms Pi, =r, Po, =In P2,=G (6.24) 


where we have assumed that G has | inputs and m outputs. 


MIXED PERFORMANCE AND ROBUSTNESS OBJECTIVE 


Suppose now that we wish to obtain good disturbance-rejection per- 
formance, and to maintain stability in the presence of unstructured multi- 
plicative output perturbations. That is, we wish to keep both the sensitivity S 
and the closed-loop transfer function J — S small in magnitude. We cannot do 
both simultaneously, of course, but must emphasize one or other objective at 
each frequency. We can do this by making 


F (P, K)= | a . A (6.25) 


where W, and W, are frequency-dependent weighting matrices; W, would 
typically be chosen by considering frequency-dependent bounds on (A), as 
discussed in Chapter 3. Defining G and K as in Figure 6.3, we see that we 
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should choose 


WwW —W,G 
Pam] ne Pa| ok P,,=I, P3,=—-G 


6.3.2 Performance robustness: an unsolved problem 


It is known that some important design problems cannot be formulated as 
H „ problems. In particular, the problem of achieving performance robust- 
ness, namely the maintenance of performance objectives—not merely 
stability — in the presence of unmodelled perturbations, is one such problem. 
We saw in Chapter 3 that the solution to this problem requires the minimiz- 
ation of the so-called structured singular value u(M) of a transfer function 
matrix M, and that this can be very different from g [M]. However, we also 
saw in Chapter 3 that 


u(M)< inf 6[DMD~"] (6.27) 
DeD 


where D is a set of diagonal, real positive matrices whose structure matches 
that of the plant perturbation. Specifically, let us suppose that the plant 
perturbation has the structure 


A=diag{A,,A,,...,A,} (6.28) 
Then D has the structure 
D=diag{d,I,d,I,...,d,I} (6.29) 


with the dimensions of the blocks in (6.29) matching those in (6.28). Further- 
more, the equality holds in (6.27) if n<3, and numerical experiments suggest 
that (6.27) gives a reasonably tight bound even if n>3. 

This suggests that the performance robustness problem could be 
solved by solving the problem 


minimize ||DF,(P, K)D~'|\,, (6.30) 
K,D 


by iteratively solving for D and K. With D fixed this is the standard H,, 
problem, and with K fixed the minimization over D is convex, so at worst 
it can be performed by search techniques. This approximate solution has 
been implemented by Doyle (1985), who reports good results, but who has 
also discovered a simple example for which such an iterative scheme will fail 
to find the optimum K and D. 
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6.4 The Youla (or Q) parametrization 


6.4.1 Fractional representations 
In Chapter 2 we introduced matrix-fraction descriptions (MFDs) of rational 
transfer functions, of the form 


G(s)=N(s)D~ !(s)=D~ t(s) Ñ (s) (6.31) 


where N, D, Ñ and D, are polynomial matrices, with N and D right coprime, 
and N and D left coprime. We now generalize these ideas to fractional 
representations: every proper transfer function (matrix) G(s) can be written in 
the form 


G(s)=U(s)V~1(s)=V—1(s)0(s) (6.32) 
where U, V, 0 and V, are stable transfer functions, with U and V right 
coprime, and U and V left coprime. Here right coprimeness means that if U 
and V have a common stable right factor X, so that U = WX and V=ZX, 


then X~! is stable, or equivalently, X has all its zeros in the left half-plane. 
(There is a corresponding definition for left coprimeness.) 


EXAMPLE 6.1 
s—2 s+3 
so- 2 = 


_[s—4]>*[ (s—2)(s—4) =| 
|] Ers s+4 


wi 


= s—2 s4+3 1 0 
| s+1 s+4 0 s—4 


s+4 


As an aside, we remark that the mathematical basis of both MFDs 
and fractional representations is the theory of rings. The set of polynomial 
matrices is a ring, and the units of this ring are the unimodular matrices. The 
set of stable transfer functions is also a ring, the units of this ring being the 
minimum-phase transfer functions. Roughly speaking, the idea of a unit is 
that it does not change the properties of an object significantly when it 
multiplies it. Thus we use MFDs when we are not concerned with trans- 
formations that leave the pole-zero properties of a transfer function un- 
changed, and we use fractional representations when we are not concerned 
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with transformations which leave the right half-plane pole—zero properties of 
a transfer function unchanged. Since our principal concern in this section is 
with stability, it is appropriate to use fractional representations. 

A useful characterization of coprimeness is given by: 


Theorem 6.1 (Bezout’s theorem): U and V are right coprime if and 
only if there exist stable X and Y such that 


XU+YV=I (6.33) 


A corresponding theorem characterizes left coprimeness. 

If G has a stabilizable and detectable realization (A, B, C, D), then we 
can obtain right and left fractional representations for it as follows. Let F be 
any matrix such that A + BF is asymptotically stable -that is, F is the solu- 
tion to any state-feedback problem. (Note that we assume that X= Ax + Bu 
and u= + Fx, whereas in Chapter 5 we assumed that u= —K,x. We make 
this change of sign in order to be consistent with most of the literature on 
H „-) Let u, x and y denote the input, state and output vectors of G, as usual, 
and define v=u—Fx. Then 


x=(A+BF)x+ Bv (6.34) 
u=Fx+v (6.35) 
y=(C+DF)x+Dv (6.36) 

so 
u(s)=[F(s!—4A—BF)~'B+I]v(s)=M(s)v(s), M stable (6.37) 

and 


y(s)=[(C + DF)(sI -A—BF) 1B+D]v(s)=N(s)v(s), N stable 
(6.38) 


so that y(s)=N(s)M~ +(s)u(s), and hence G=NM~}. 
To get the other representation, choose any H such that A+HC is 
stable (that is, solve any observer problem). Consider the observer equations 


€¢=(A+HC)C—Hy+(B+HD)u (6.39) 


n=ClC+Du—y (6.40) 
Then 


n(s)= —[C(sI-A—HC)~1H +1 ]y(s)=M(s)y(s), M stable 
(6.41) 
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and 
n(s)=[C(sl -A—HC)"1(B+ HD)+D]u(s)=N(s)u(s), Ñ stable 
(6.42) 


so that y(s)=M~1(s)N(s)u(s), and hence G=M~1N. 

We shall be able to show later that these fractional representations are 
in fact coprime. We shall also see that under certain conditions we can choose 
F (or H) such that N (or N) has the all-pass (or inner) property, namely that 
N"(j@)N(jo)=TI, which is very useful for the solution of the H „ problem. 


6.4.2 Parametrization of all stabilizing controllers 


From now on we assume a positive-feedback convention (as in Figure 6.4). 
We saw in Chapter 2 that internal stability of the closed loop is equivalent to 
the matrix 


I -Ky]"' 
-G I 
being asymptotically stable. 
Theorem 6.2: Let G and K have the fractional representations 


G=NM~!=M~'N and K=UV~!=P~'U. Then the feedback loop 
is stable if and only if both 


M U]7t 
w Ter) 
and 
V —Uy]"! 
a |x | 


are stable. 


Proof: 
(1) 


ak lee a (6.43) 
nvilo vi 


THE YOULA (OR Q) PARAMETRIZATION 277 


I Kt [M O|[M UV! 
= 6.44 
Pee a la (4 
Now, using the fact that the pairs (N, M) and (U, V) are coprime, and 
Theorem 6.1, it is easy to show that 


aes and ae are coprime 
0 V N V a 


K |"! M U]"! , 
Hence | G I is stable only if N y is stable. 


M O0}. M U|"?. JI K . 
Also | A is stable, JF | is stable only "lg | is 


I -KV+. 
stable. The only thing to be shown now is that G I l is 


I KV! 
stable if and only if | G I | is stable. This is most easily done by 


so 


denio then bye f I -K d I K 
considering the transmission zeros o ie I an G iP 


noting that they are the same, since the determinants of both matrices 
are given by det(I —GK).? 
This completes the proof of (1); (2) is proved similarly. | 


Theorem 6.3: With the same notation as in Theorem 6.2, if the 
feedback loop is stabilizable, then M, N, U, V, M, N, U and V can be 
chosen such that 


V -ÖM U] [I 0 
| ally rile J a 
and K=UV~!=V~-'U isa stabilizing controller. 


Proof: Choose M, N, M and Ñ, to be as in (6.37), (6.38), (6.41) and 
(6.42), respectively, so that M has the state-space realization (A + BF, 
B, F, I), N has the realization (4+ BF, B, C+ DF, D), and so on. 

We can obtain a stabilizing controller by using the state- 
feedback matrix F and the observer-gain matrix H in a combined 
state-feedback/observer structure, as shown in Figure 6.6. (Such con- 
trollers are often called observer-based.) 

We have already shown, in Chapter 5, that such a controller is 
stabilizing (except that there we assumed that D =0; the proof follows 
in the same way in the general case). 
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Figure 6.6 An observer-based stabilizing controller. 


From Figure 6.6 we obtain the equations 

€=AC+Bu+H(j—y) (6.46) 
and 

u=F¢ (6.47) 
which can be combined to give 

(=(A+BF+HC+HDF)6—Hy (6.48) 


Now, let u=y—Cf—Du. Then, since f= C¢ + Du, (6.46) becomes 


f=AC+ Bu-Hyu . (6.49) 
=(A+BF)C—Hy (6.50) 
But 
y=Cl+Duty (6.51) 
=(C+DF)C+u (6.52) 
so we have 


y(s)=[—(C + DF) (sl -A— BF) 'H +1 ]u(s)= V(s)u(s), 
V stable (6.53) 
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and 


u(s)= — F(sI— A — BF) 'Hqu(s)= U (s)u(s), U stable 


(6.54) 
so that u(s)=U(s)V~ '(s)y(s), and hence K=UV~?. 
By a dual construction, it can be shown that if 
P(s)= —F(sI—A—HC)~ (B+ HD) +1 (6.55) 
and 
U(s)= —F(sI —A—HC)"'H (6.56) 


then K = V~'U, and U and Ÿ are stable by construction. 
Now, from (6.37), (6.38), (6.53) and (6.54), we see that 


ly 7 


has the state-space realization 


(4428, [B —H], hes ont j T 


and from (6.41), (6.42), (6.55) and (6.56), we see that 


V —U 
-Ñ 


has the realization 


(a+1c, [—(B+HD) H], EE a) 


It is straightforward to show that, if a transfer-function matrix 
X has the realization (A, B, C, D) and D` + exists, then X~! has the 
realization (A— BD~'C, —BD~'!, D~!C, D71). From this we see that 


Isr] 


Za 
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has the realization 


0 
(4+8F-[B -m| 5 4 pari 


[5 Me eae 


-(4+BF-[8 -m| ¢ | t-e+ 40) H], 


[ch L-o 1) 


=(AsHc [-(B+HD) H], H E r]) 


which is the realization given above for 


VY -Ü 
-Ñ M 


Thus (6.45) is verified for the particular fractional represen- 
tations we have constructed. K is stabilizing by construction, so 
Theorem 6.3 is proved. E 


Note that (6.45) can be combined with Theorem 6.1 to prove that all 
four fractional representations which we have constructed for G and K are 
coprime. For example, to show that U and V are coprime, apply Theorem 6.1 
with X= — Ñ and Y=M. 

A most important consequence of Theorem 6.2 is that any controller 
K=UV~'!=V~—'U for which (6.45) holds is a stabilizing controller. We make 
use of this now in proving the central result of this section: 


Theorem 6.4: Let Ky =U V5!=V51Uo be such that (6.45) holds. 
For any QeH.,, (that is, for any realizable, stable Q of compatible 
dimensions), define 


U=U,+MQ, V=V,+NQ (6.57) 
U=0,+0M, V=V,+ON (6.58) 


(1) Then UV-!=V~'U, and K=UV~*= V-'U is a stabilizing 
controller for G=NM~!=M "IN. 
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(2) Furthermore, any stabilizing controller has fractional rep- 
resentations (6.57) and (6.58). 


Proof: 


(1) 
V -—U|[M U] [V¥,+ON -—U,-OM 
-Ñ MIIN V] | -Ñ M 
M U,+MQ 
N V,+NỌ 


(6.59) 
Since (6.45) holds for Uo, Vo, Up and V5, we have 
NM—MN=0 and NU,—MV,=V,M—U,N=I (6.60) 


Performing the multiplication in (6.59) and using (6.60), we obtain 


ý -ÜM U I 0 
lea mlly rile 1 (on 


From the (1,2) block we get VU=UY, so UV-'=V1U. From 
Theorem 6.2 we see that K=UV~'=V~1U is stabilizing. 


(2) Suppose that K is realizable, stabilizes the feedback loop, and has 
the fractional representation K=UV~'=V7!U. 


Consider 


(6.62) 


From Theorem 6.2 we know that this is the product of two stable 
matrices, and so is itself stable. Hence (MV—NU) ! is stable. Let 


Z=MV—NU (6.63) 
and 


Q=M~'(UZ~!—U,) (6.64) 
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(note that MQeH,,). Then 


VotNQ=Vo+NM~*(UZ~!—U,) (6.65) 
=V,+M~!N(UZ~1—U,) (6.66) 
=M~'[MV,—NU,+NUZ~'] (6.67) 
=M-'[14+NUZ~ 4] (6.68) 


(since Uo and V, satisfy equation (6.45)) 
=M~'[Z+NU]z" (6.69) 
=V7* (6.70) 


(from (6.63)). Note that from this we can deduce that NQEH,,. 
From equation (6.64) we have 


U=(U,+ MQ)Z (6.71) 
and combining this with (6.70) we get 
K=UV~!=(U,+MOQ)(V,+NQ)7! (6.72) 


so that ((Uo + MQ), (Vo + NQ)) is a fractional representation of K. 
If we now set U=U,+QM and V= o +QÑ, and proved as 
in the proof of part (1), it will be apparent that V-'U=UV—1. 
The only thing left to do is to prove that QE H „. This follows 
easily, since we have shown that both MQ € H „ and NQ € H „, and N 
and M are assumed to be coprime. a 


Theorem 6.4 gives a surprising and very powerful result. It says that, 
once we know one stabilizing controller for a plant, we can easily generate the 
family of all stabilizing controllers, by means of fractional representations. 
We shall soon see that everything can be expressed in terms of state-space 
models, so that practical algorithms based on this result can be obtained. 

But first we shall explore some implications of Theorem 6.4. We begin 
by obtaining an expression for the controller’s transfer function K in terms of 
Ky and Q: 


K=(U,+MQ)(Vo+NQ)"* 
=(Ko+MOV5')\U+NOQV5')7! (6.73) 


=(Ko+ MOV ÐU- NQ +V NQ) Vot] i (6.74) 
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Figure 6.7 Controller structure if the plant is stable. 


(by the matrix-inversion lemma) 


=K,)—K)NQU+Vo!NQ)'Vo'+MQV5'U+NQV5')7* 


(6.75) 
=Ko—(Ko NQ- MQ) +V NQ) V5! (6.76) 

(since Vg (I +NQV3 +) =U +V NQ) VS) 
=Ko—(UoV3!N-M)QU +V NQ) V5? (6.77) 


But 

Uo Vo N-M=V3'ÛŪ,N-M= 5 ÛŪ,N—- Va M)=— 73! (6.78) 
(by equation (6.45)). So 

K=Kyo+V5'QU+V5'1NQ)'V5! (6.79) 


If we suppose temporarily that the plant G is stable, then we can take 
N=G, M=I and K,=0; the latter has representations Uy=U,)=0 and 
Vo=, =I, for example. In this case (6.79) simplifies to K=Q([+GQ)"}, 
from which we obtain Q=K(I—GKY t, which is precisely the ‘Youla 
parameter’ Q which we used in the sensitivity-minimization example in 
Section 6.2 (except for a sign change, the result of using a negative-feedback 
convention in the example, and a positive-feedback convention here). From 
the expression for K we can see that K can be obtained as the feedback 
connection of Q and —G, as shown in Figure 6.7. This shows that the 
controller is cancelling the plant’s dynamics by putting — G in parallel with 
the plant’s transfer function G, and inserting arbitrary stable dynamics, so 
that u= Qr, where r is the ‘reference input’. 

But (6.79) shows that things get more complicated if the plant is 
unstable. Why is this? In particular, why will the simple scheme of Figure 6.7 
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Figure 6.8 Controller structure if the plant is unstable, but can be stabilized bya 
stable controller Ko. 


fail? The reason is clearly that the instability in G only ‘disappears’ from 
certain transmission paths—in particular, from the path between r and u. 
Using the controller shown in Figure 6.7, we would have, for example, that 
y=G¢Qr, which would be unstable. 

But suppose that we first stabilized the plant using a (feedback) 
controller Ky, and then applied the simple scheme of Figure 6.7? What 
structure would our controller have then, in terms of the original plant? It 
turns out that if Ko can be chosen to be stable, it would have precisely the 
structure given by (6.79)! We shall show this shortly, but we first note its 
implication for design methods. One might suspect that following a two-step 
design procedure, stabilizing the plant in the first step and then modifying the 
dynamics of the stabilized plant in the second, would not allow all possible 
designs to be achieved: the first step might in some way limit the design 
freedom in the second step. Theorem 6.4 (and its consequences) shows that 
this is not the case if the plant can be stabilized by a stable controller. 

Figure 6.8 shows the controller which results from such a two-step 
procedure. The stable controller Ko is used to stabilize the plant. This yields a 
modified system, with transfer function (from u to y): 


H=G(I—K,G)"! (6.80) 
The scheme of Figure 6.7 is now used to construct a controller for H which 
consists of the feedback connection of Q € H „ and — H. The transfer function 


of the complete controller is 


K,=K,+Q(1+H@Q)"' (6.81) 


THE YOULA (OR Q) PARAMETRIZATION 285 


and 
H=NM"'(I— z 'Ū, NM)! (6.82) 
=N(M— V3 ' ÛN)! (6.83) 
=NP, (6.84) 


(by 6.78). Hence we have 
K,=Ko+Q(I+NV,.Q)"', QeH, (6.85) 
Now consider the controller K defined by (6.79), which we rewrite as 
K=Kyt+Vo10Vo1U+NOV,'), QeH, (6.86) 


Remember that we have assumed that K, € H „; but this implies that V5 + and 
V5teH„, so that 


R=V,'QVo1eH,,, if and only if QeH,, (6.87) 


(The ‘only if? comes from the fact that Voce H, and PeH.) Given any 
QeH.,, in (6.86) we can obtain the corresponding Re H,, from (6.87), and 
given any Re H „, we can obtain the corresponding Q €H „ from 


Q=V RV, (6.88) 
We can therefore rewrite (6.86) as 
K=K,+R(UI+NV,R)"! (ReH,) (6.89) 


which has precisely the same form as (6.85). Thus we have shown the 
equivalence of (6.79) to a controller obtained by a two-step method, provided 
Ko can be chosen to be stable. 

Some plants can be stabilized only by an unstable controller Ky, in 
which case the scheme shown in Figure 6.8 will fail because the instability 
in Ky may show up at some nodes in the system. For this general case, 
Figure 6.9 shows a block-diagram representation of (6.79). Here the dashed- 
line rectangle encloses that part of the controller which remains fixed while Q 
is allowed to vary over H,, to obtain all possible stabilizing controllers. 


6.4.3 All stabilizing controllers are observer-based 


Here we show that all the controllers described by (6.79), and hence all 
stabilizing controllers, have observer-based state-space realizations. We have 
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Figure 6.9 General structure of stabilizing controllers (Qe H, ). 


already obtained realizations of N, Ko, Vo and Ñ, (in equations (6.38), (6.47), 
(6.48), (6.53) and (6.55)). We begin by assembling these together to obtain a 
realization for the fixed part of the controller, enclosed by the dashed-line 
rectangle in Figure 6.9. Let this fixed part have transfer function J, so that the 
controller can be redrawn as in Figure 6.10. Then, in the notation introduced 


in Section 6.1, 
K=F\(J,Q) om) 


and (from Figure 6.9) 


Ky Vo! 
= 6.91 
7 P - Ps oe 


Figure 6.10 As Figure 6.9, but with the system enclosed by the dashed-line 
rectangle represented by /. 
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Using the formula for the realization of an inverse system (see Section 
6.4.2), it is easy to show that Vg * and Vo‘ have the following realizations: 


Vo':(A+BF+HC+HDF, H, C+DF, 1) (6.92) 
Vo1:(A+BF+HC+HDF, B+HD, F, I) (6.93) 
and hence Vj +N has the realization 


= A+BF+HC+HDF H(C+DF) HD 
Vo N: R 5 
0 A+BF B 


[C+DF c+DF},D) (6.94) 


The change of state coordinates 


shows that an alternative realization is 


pe. A+BF+HC+HDF 0 B+HD 
ways 0 A4+BF |’ B i 


[C+DF 0], p) (6.95) 


which is unobservable, and can therefore be simplified to 
Vo'N: (A+BF+HC+HDF, B+HD,C+DF, D) (6.96) 


Putting together equations (6.47), (6.48), (6.92), (6.93) and (6.96), we obtain a 
realization for J: 


F 
J: (4+BF HC+HDF,[—-H B+HD], ; 
\ j i L FHD] b 


an 6.97 
i-o) 7 


If we let the inputs to J be the vector [y" y1]" and its outputs be [u? uT]", as 
shown in Figure 6.10, then the state-space equations corresponding to (6.97) 
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Figure 6.11 Alternative representation of all stabilizing controllers (Qe Ha). 


are 
(=(A+BF+HC+HDF)(—Hy+(B+HD)y, (6.98) 
u=Fl+y, (6.99) 
uy=—(C+DF)C+y—Dy, (6.100) 


It is straightforward to check that (6.98) and (6.99) are the equations of 
an observer-based controller, as shown in Figure 6.6, which has been modi- 
fied by adding an additional signal y, to the controller output u. The 
additional term (B+ HD)y, in (6.98) ensures that the observer is driven by the 
same control signal as the plant. Equation (6.100) reveals that the signal u; is 
just the difference between the estimated and the actual plant output (the 
innovations signal, if the observer is a Kalman filter). But y, =Qu,, so the 
structure of every stabilizing controller has the form shown in Figure 6.11: an 
observer/state-feedback structure, with an arbitrary, stable, proper transfer 
function Q inserted as shown by the dashed line. (Compare Figure 6.11 with 
Figure 5.2.) S 

Yet another interpretation of the structure of stabilizing controllers 
can be obtained as follows. Let x, be the state vector of a realization of Q. 
Then we can supplement (6.98)—(6.100) with the equations 


X= AX + B4; (6.101) 


Yı =C,xX,+ Duy (6.102) 
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To simplify the algebra, we shall assume that the plant is strictly proper, 
namely that D=0. However, we shall retain D, # 0. 
If we eliminate u, and y, from (6.98)—(6.102), setting D =0, we obtain 


(=(4+ BF +HC-—BD,C)6+BC,x,+(BD,—H)y (6.103) 
X,=A,x,—B,CC+B,y (6.104) 
u=(F—D,O)C+C,x,+D,y (6.105) 


A 0 B 
Now, it we let x=] | ; l | B=] l 
Xq 0 A, 0 


C.=[C 0, FiF Cl e 


then (6.103)-(6.105) can be rewritten as 
Xe=(A,+B.F.+H.C,—B.D,C,)x,.+(B.D,—H,)y (6.106) 
u=(F,—D,C,)x.+D,y (6.107) 


This gives a state-space realization of the complete controller K (complete in 
that it includes the dynamics of both J and Q). Note that K is strictly proper if 
and only if D,=0, namely if and only if Q is strictly proper. 

If D,=0 then (6.106) and (6.107) are the equations of an observer- 
based controller for a plant with realization (A,, Be, C,, 0). But 


C,(sI—A,)~ 1B, =C(sI — A)" 1B = G(s) (6.108) 


so (6.106) and (6.107) are actually the equations of an observer-based con- 
troller for the original plant, with its dynamics augmented in such a way that 
its transfer function is unchanged. Hence we conclude that all strictly proper 
stabilizing controllers have the structure of an observer-based controller for 
an augmented model of the plant. 

If D, #0, then we retain the observer-based structure, with the ad- 
dition of a signal path through D,, as shown in Figure 6.12. Note that this 
controller is stabilizing with any D,, provided F, and H, give a stable 
observer/state-feedback combination. (Exercise: Why?) 


6.4.4 Parametrization of closed-loop transfer functions 


To investigate those closed-loop transfer functions which appear in H 5 
problems, we need to return to Figure 6.2 and exploit the structure which the 
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Figure 6.12 State-space representation of all stabilizing controllers for strictly 
proper plant. 


controller possesses. Figure 6.13 shows the original scheme of Figure 6.2, 
suitably redrawn. Note that P,,=G (since we are using a positive-feedback 
convention; it would be’ P,,= — G otherwise). Let us combine P and J into 
the transfer function 7, as shown in Figure 6.14. (Warning: In earlier chapters 
we have used the symbol ‘T’ to denote something else.) We have 


z= FP, K)w (6.109) 
= F(T, Q)w (6.110) 
=[T; +T 0U- T20) Ta ]w (6.111) 


We shall show that T33 =0, so that (6.111) simplifies to 


z=[T;; + T120 Ta |w (6.112) 


Figure 6.13 As Figure 6.2, but with the controller represented as in Figure 6.10. 
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Figure 6.14 As Figure 6.13, but with the feedback connection of P and J rep- 
resented by T. 


which shows that, even in the general case, closed-loop transfer functions are 

affine in the Youla parameter Q. Note (from Figure 6.9 or 6.11) that u, has the 

same dimension as y, and y, has the same dimension as u. Consequently T,,, 

T,, and T,, have the same dimensions as P,,, P,, and P,,, respectively. 
To show that T,,=0, we use Theorem 6.4 to obtain 


K(I—P.K)"* =(Uo +MQ)(Vo + NQ) ' 
x [I-M~*N(Uy+MQ)(Vo+NOQ)~*]7! (6.113) 


=(Uo+MQ)[(Vo+NQ)—M~1*N(U,+MQ)]"! 


(6.114) 
=(U)+MQ)[(MV,—NU,)+(MN—NM)Q]~'M 
(6.115) 
=—(U)+MQ)M (6.116) 
in which the last step follows from (6.60). From this we get 
F\(P, K)=P,,—P,.(Up + MQ)MP,, (6.117) 
=(P1,—P,,UpMP,,)—P,,MOMP,, (6.118) 
Comparing this with (6.111), we obtain the correspondences 
Tı =P, — P 2U MP, 
Tı2=—P,:M, Ta =MP,,  T,,=0 (6.119) 


Another way to see that T,, =0 is to redraw Figure 6.11 in the form of 
Figure 6.15, with Q(s) omitted. Now T,, is just the transfer function between 
yı and u,. A standard result of observer theory, and one which we also came 
across in Chapter 5, is that this transfer function is zero since the state 
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Figure 6.15 Plant with stabilizing controller as in Figure 6.11, but with Q omitted. 


estimate Č is uncontrollable from y,, and the plant’s state is unobservable 
from u, (see Exercise 6.5). 

This proves the main result of this section. 

In order to be able to solve H „ problems using state-space algorithms, 
we need state-space models of 7,,, Tiz and T,,. We shall assume that a 
(minimal) realization for P is 


; C, Dy, Dy 
(aw aE: Se) emn 


so that P;()=C;(sI—A)+B;+D; i,j=1,2 (6.121) 


It can be shown that, if F is a stabilizing state-feedback matrix for the plant 
(that is, if A+ B,F is stable), and H is a stabilizing observer-gain matrix for 
the plant (i.e. if A+ HC, is stable), then realizations of T,,, T12 and T,, are 
given by 


T E A+B,F —B,F B, 
a 0 A+HC, | | B,+HD3, |’ 


[C,+D,.F — DF], Da.) (6.122) 
T 131 (A+B3F, By, Cy +D12F, D33) (6.123) 
T 1: (A+ HC), Bı +HD3,, C2, D21) (6.124) 


Note that these are all stable. 
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This concludes our rather long section on the Youla parametrization. 
We can now use the results we have developed, particularly (6.112), to solve 
the H,, problem. 


6.5 Solution of the H,, problem 


6.5.1 Equivalence to the model-matching problem 


In Section 6.1 we formulated the general H „ problem as 
minimize || F\(P, K)||,, 


over stabilizing compensators K. 
Combining Theorem 6.4 with (6.112), we can rewrite this as 


minimize || 7,,+7,.0T2,||, 
QEHo 


This is sometimes known as the model-matching problem because to solve it 
we need to choose Q such that 7,,QT>, ‘matches’ the ‘model’ — 7, as well as 
possible. 

If T,, and T,, are square, it looks at first sight as if we should be able 
to set Q= — T> T, T3, , and the problem would be solved. In practice this 
is never possible because the resulting Q is neither proper nor stable. For 
illustration, consider the sensitivity-minimization example presented in 
Section 6.2. We saw in Section 6.3.1 that this problem corresponds to 
choosing 


Pia =W, Pi2= — WG, P251 


Since G is assumed to be stable, we can take T,,=P,,, Ti2=P;z and 
T21 =P», in this case (see Exercise 6.3) which gives 


-Ti T Tz =G! 


This is unstable, if G is assumed to have a right half-plane zero. Even if this 
assumption is dropped, G~' is not proper if we assume that G is strictly 
proper. 

i To obtain a solution of the model-matching problem, we shall show in 
the next section that it is equivalent to yet another problem, the so-called 
‘Hankel approximation problem’, for which a complete solution algorithm 
exists and will be given in Section 6.6. 

Once a solution to this problem is obtained, in the form of a state- 
space realization of Q, the required controller can be obtained (in state-space 
form) using equations (6.103)—(6.105). 
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6.5.2 Equivalence to the Hankel approximation problem 


In this and subsequent sections we shall use X*(s) to denote X"(—s). We 
shall usually drop the argument (s), as elsewhere in this chapter. 

Let us assume, for the time being, that T,, and T}; are square and all- 
pass (or inner), namely that 


T2 Tř, = I and Ts; Tai = I (6.125) 
It is easy to show that, if X and Y are all-pass, and o(.) denotes any singular 


value of (.), then o(X AY)=o(A), and that, for any A, o(A*)=0(A). Hence we 
have 


lTi + Ti20Tz lo = ll Mi2(Th T11 T31 + Q)T 21 Ilo (6.126) 
= || Tý Ti T31 +Q] (6.127) 
=|| T2 TT, T12 +0* |o (6.128) 


Since Qe H„ and 7,,¢H,,, Q* has only unstable poles, while T3, 77,712 
potentially has both stable and unstable poles. 

It can be shown, by a rather complicated analysis (Limebeer and 
Hung, 1987), that T,, 7%, T} in fact has only stable poles. If we define 


R=T,,77, Ti2 (6.129) 
then 

min || Tii +T120Tz1 llo = min ||R+Q*ll. (6.130) 

QeH,, QeH,, 


which converts the model-matching problem into the problem of approxi- 
mating a stable transfer function (R) by an unstable one (—Q*). This is 
known as the Hankel approximation problem, or the Nehari extension 
problem. 

We made two assumptions about 7,, and T,, to obtain (6.130): that 
they are square and all-pass. Somewhat surprisingly, the second assumptior 
is not at all restrictive because it is usually possible to choose F and H in 
(6.123) and (6.124) so that the assumption is satisfied (see Exercise 6.8). It is 
always possible to find a suitable F and H if P4, and P,, are square, and P is 
stabilizable. 

The validity of the first assumption, however, that T,, and T3, are 
square, is determined entirely by the problem statement, and is violated by 
some important classes of problem — for example, the mixed performance and 
robustness problem described in Section 6.3.1. 
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6.5.3 1-block, 2-block and 4-block problems 


If T,, has more columns than rows, and T,, has more rows than columns, 
then there is no problem, because T¥, and T$, can still be found such that 
equations (6.125) hold. The whole of the development given above is still 
valid. 

More commonly, though, T,» has more rows than columns, or T3, 
has more columns than rows, and things then become more complicated. It is 
no longer possible for (6.125) to hold; however, it is always possible to find 
transfer functions T,,, and T,,, such that [T,, T121] and [T], TA] are 
square and all-pass. Hence we have that 


It T. 
| ltrs Ty2,]=I and | Ja Tžu]=I (6.131) 
Thai Tail 
Proceeding as above, we obtain 


lTi + TiTa llo = 


Tř, 
E [en 1+7,2.07T2,)[73, TŽ] 
121 


æ% 


(6.132) 

_ || Th. å Q 0 
EA Ta [T3 raele I . (6.133) 
= lars oe | (6.134) 


where 


Ry, =T{ T, T3, Ri2=T{2T1 7H, 
Ra, = Th. T,,T3,, Ry2=Thi Ta T$ 


If T3, is square (or has more rows than columns), then T,;, does not 
exist, and (6.134) simplifies to 
i Tai J 
R21 


and if T13 is square (or has more columns than rows) then T, 21 does not exist, 
and (6.134) simplifies to 


I LP, +7T;2,Q0T>, Ilo = (6.135) 


leo) 


lTi + T2072: llo =lER + Risin (6.136) 


In the literature on H „ control theory, problems which result in the formu- 
lation (6.130) are often called 1-block problems, while those which result in 
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(6.135) or (6.136) are called 2-block problems, and those which result in (6.134) 
are called 4-block problems. 

It is possible to develop solutions to all these problems along similar 
lines to that taken above for the 1-block problem (Doyle, 1984), but this 
approach would lead to unnecessarily complicated derivations for the 
2-block and 4-block problems, and to very inefficient solution algorithms. It 
is better to use an alternative algorithm, which is given in Section 6.7 without 
any derivation. That algorithm is in fact to be preferred for all three kinds of 
H „ problem, although a workable algorithm is obtained for the 1-block 
problem by solving (6.130), using the results obtained in Section 6.6. 

The minimum achievable value of || 7;,+7,.QT2;||.. can be com- 
puted for 1-block problems, and the optimal value of Q can be obtained by a 
single application of the appropriate algorithm. This is not so for 2-block or 
4-block problems, for which one must choose some real positive scalar y, 
apply the algorithm to discover whether there exists a Q such that 
| T11+71220T21 llo <7, and if so obtain the corresponding compensator. An 
optimal controller is found by iteratively searching over values of y. There is 
some reason to believe that such an iteration is inevitable in any algorithm for 
solving general H „ optimization problems. 

It should be borne in mind, however, that often we do not really need 
to minimize || T;, + 712271 ||, but just keep it below some particular value. 
This is true of the robustness problem treated in Section 6.3.1, for example. If 
we simply wish to obtain || 7; , + 7122721 Ilo <m, say, then we need only take 
y=m, and check whether a solution exists (and if so, find it). In this case no 
iteration is involved. 


6.6 The Hankel approximation problem 


6.6.1 The Hankel norm 


In Section 6.5.2 we saw how to convert 1-block H „ problems into so-called 
Hankel approximation problems which take the form 


minimize ||G+Y*||,, GeH, (6.137) 
YeH 


In this section we shall show how to solve such problems, again using state- 
space algorithms. Our treatment will therefore be limited to rational G, and 
we shall assume that we have a realization G: (A, B, C, D) available as ‘input’ 
to our algorithms. 

First we need to define the Hankel norm of G, which will be denoted by 
|G lly. We assume that GeH,,, and we let P= P™ and Q =Q" be the 
solutions of the Lyapunov equations 


AP + PA‘™ + BB" =0 (6.138) 


THE HANKEL APPROXIMATION PROBLEM 297 


and 
A'Q+Q0A+C'C=0 (6.139) 


(P and Q are the controllability and observability gramians, respectively). 
Then the Hankel norm of G is 


IG liu = E(P)? ] (6.140) 


that is, it is the spectral norm of (PQ)'/?. A more informative interpretation of 
|| G || is the following. We can define the Hankel operator corresponding to G 
as the following operator on functions v(t): 


oo 


(Tgv)(t) = | Cexp[A(t+)]Bo(t)dt (6.141) 


0 


Note that this is not the same as the familiar convolution operator, because of 
the + sign in the kernel. Suppose that we have an input u(t) which is non- 
zero only for —œ <t <0, and let the resulting output over the interval 
0 <t < © be y(t). Then Tg is the operator which maps u(t) into y(t); that is, it 
maps past inputs into future outputs. This can be shown as follows. 

Let v(t) = u(—t)(0 < t< œ). Then 


(Ugv)(t) = |. Cexp[A(t+1)] Bu(—1t)dt (6.142) 
(0) 

= | i Cexp[A(t—1)]Bu(t)dt ` (6.143) 

= y(t) (6.144) 


(since u(t) = 0 for t > 0). 

The Hankel norm || G ||, can be shown to be the largest singular value 
of the Hankel operator Tg. We shall not explain here what is meant by a 
singular value of an operator, but remark that it can be given a ‘largest-gain’ 
interpretation in the following sense: 


1 Gig = sup OL ae t<0 tus 


lul lut)=0, t>0 


Note that only the dynamic part (A, B, C) of G influences the Hankel 
norm, which is independent of the matrix D. 

We shall now state a theorem which places a lower bound on the 
achievable error in the Hankel approximation problem, in terms of the 
Hankel norm. In the next section we shall show that a particular approxima- 
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tion achieves this lower bound, and hence solves the Hankel approximation 
problem. 


Theorem 6.5 (Glover, 1984): Given rational GeH,,, then, for any 
YeH,, 


IGlu<||G+Y*l. (6.146) 


Proof: Omitted — see Glover (1984). 


6.6.2 Glover’s algorithm 


We now present an algorithm, developed by Glover (1984), for solving the 
Hankel approximation problem 


minimize || G + Y*||,, GeH, (6.147) 


Yelo 
when G is square. 


Step 1 Obtain a balanced realization of G: (A, B, C, D), namely one such that 
the Lyapunov equations 


AZ + AT + BBT=0 l (6.148) 
and 


ATE +ZA4+C7C=0 (6.149) 


have a common diagonal solution 


cl 0 
= 6.150 
where X, =diag{o,,...,0,),ando,>06,>... > C, > O. It can be shown 


that such a balanced realization always exists (if GeH œ); for a method of 
finding it see Exercise 6.13 and Section 8.3.9. 
Note that 


Gila = oy (6.151) 


Step 2 Partition A, B and C conformally with È: 


Ai Ay B, 
A= A C a orks 6.152 
be ral B [C, C3] (6.152) 


THE HANKEL APPROXIMATION PROBLEM 299 


Let T = £} — g} I (where the unit matrix has the same dimensions as £ 1—not 
usually the same as the unit matrix appearing in (6.150)), and choose a U such 
that UU" = I and B, = — CĪ U; sucha U always exists, since B, BT = CTC. 


Step 3 Define the following matrices: 


A=T~'(63 Al, + ©, Az.E, — 0, CT UB!) (6.153) 
B=T t (2, B, +0, CTU) (6.154) 
C= -C, E; — o, UBI (6.155) 
D=—D+6,U (6.156) 


and let Y,,, have the realization (— 47, — CT, BT, DT). 


Then we have 


Theorem 6.6: 
(1) Yon EH o (6.157) 
(2) IG + Yll = IG liu (6.158) 
3) (G+ Y4)(G+ Y$) = 03 (6.159) 

REMARKS 

(1) Parts (1) and (2) of the theorem, together with Theorem 6.5, show that 
Y, is a solution to the Hankel approximation problem. Part (3) 
shows that the approximation error G + Y* is an all-pass function, 
and hence that the magnitude of the error is the same at all fre- 
quencies. In applications to H,, problems this has the consequence 
that o[ F,(P, Kop) (j@)] is constant. 

(2) As remarked earlier, when solving H „ problems we may be satisfied 


by finding a YeH,, such that |G+ Y*||„ <m, where m is some 
specified real number, and not be particularly interested in finding the 
optimal Y which minimizes ||G + Y*||,,. A suitable Y will exist if and 
only ifm > c, and can be found from the following realization for Y*: 


In ==2—m?] (6.160) 


Â= T; (mAT + DAD) (6.161) 


B=T7'EB (6.162) 
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C= -CE (6.163) 


D= -D (6.164) 


Safonov et al. (1987) have pointed out that in this case there is no need 
to find a balanced realization: it is sufficient to find the controllability 
and observability gramians P = P" and Q = Q7 which solve the 
Lyapunov equations 


AP + PAT + BB™ =0 (6.165) 
and 


ATQ+Q0A+C'C=0 (6.166) 
for any realization of G, and then replace (6.160) by 
Tl, =QP—m’!I (6.167) 


and replace © A in (6.161) by QAP, 2 in (6.162) by Q and © in (6.163) 
by P. 


Proof: The proof is omitted. It is long but quite straightforward, and 
is given by Glover (1984). 


Theorem 6.6 assumes that G is square. If it is not square, we 
augment G with rows or columns of zeros to make it square. Suppose that 
we need to augment it with columns, so that Gq = [G 0] is square. 
We then use Theorem 6.6 to obtain a square matrix YeH,, such that 
Gog + Y* llo = Il Geq lla = || Gly. Now partition Y so that 


lGa + ¥*.= [G+ YT 0+ Yo (6.168) 
= (Glu 
Then 
IG + Y{ llo < IG lla (6.169) 


and hence, by Theorem 6.5, 
IG+ Y{ llo = |G lln (6.170) 
Therefore Y, is a solution to the Hankel approximation problem. 


The theory and results of the Hankel approximation have a number of 
applications in addition to solving H.,, problems. The principal one is to the 
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approximation of high-order state-space models by simpler ones (Glover, 
1984). 


6.7 The Glover—Doyle algorithm for general 
H,, problems 


The algorithm to be presented in this section is taken from Glover and Doyle 
(1988); a proof of its validity is outlined by Doyle et al. (1988). At the time of 
writing these results are very new, and several alternative proofs are being 
developed. It is too early to say which of these will eventually emerge as the 
shortest, clearest or most elegant. 


The algorithm yields a family of stabilizing controllers K, such that 
| Fi(P, K)o <7 (6.171) 


for some chosen value of y, if any such controllers exist. We assume that a 
realization of P is given by (6.120) and, -using earlier notation, that the 
dimensions of P,,, Pi2, P21 and P,, are py x m,, py x mz, pa x m, and 
Pz X mz, respectively. Furthermore we assume that (A, B3, C2, D32) — namely 
the plant — is stabilizable and detectable, so that stabilizing controllers exist, 


and that rank (D,,) = m, and rank(D,,) = p2, in order to ensure the realiz- 
ability of controllers which solve (6.171). 


To simplify the notation, we perform non-singular transformations on 
u and y (see Figure 6.2): 


i=S,u, jf=S,y (6.172) 
and unitary transformations on w and z: 
w= T,w, Z= 7,2 (6.173) 


(TTY = I = T,T®). This gives 
z Pa A | 
A J a.) eth, “Ee 6.174 
5] H P > u ¢ ) 
a | PaT} T,PiaS Ww 
~ Be TË S, Pi St |e O72) 
and, with a feedback controller K = S,KS, * in place, 
Ww = F\(P,K)z = [T, F\(P,K)T#]Z (6.176) 


Since T,, and T, are unitary, we have 


I Fi(P,K) lo = |F,(P, K) ec (6.177) 
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We can therefore replace P by P, find a suitable controller K, and then obtain 
the required controller 


K =S,'KS, (6.178) 


Now, the point of these transformations is that we can choose T, and S, such 
that 


~ 0 
Di2 = T,D,28,* = H (6.179) 


and 7, and S, such that 

Dy, = S,Da, T” = [0 1] (6.180) 
One way of finding the required transformations to achieve this is to use the 
singular-value decompositions of D,, and D3, to obtain row and column 
compressions, as explained in Chapter 8. Since this can always be done 
(because of our assumptions about the ranks of D,, and D3, ) we shall assume 
that these transformations have already been performed, and therefore that 


D, =[01]" and D,,=[07] (6.181) 


We assume that D,, is partitioned as 
D 
Dam | pe os (6.182) 


where D1122 has m, rows and p, columns. 
For the time being we assume that the plant is strictly proper, namely 


that 
D,,=9 (6.183) 
Finally we assume that 
A—jol B, 
k = - 6.184 
ran | C, “a n+my, ( ) 
and 
rank =n+p (6.185) 
| C, Day i 


for all real œ, where A has dimensions n x n. 
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It is useful to introduce the notation 


Dig =[Di1 Di2] and Dy, = [D], D} T (6.186) 
and define 
R =D], D, = ies J (6.187) 
0 0 
and 
RaDa K a (6.188) 


Now we define X,, and Y,, to be the stabilizing solutions of the 
following Riccati equations: 


X.(A—BR*D},,C,) +(A—BR'D},C,)'X., 


— X „BR !B"X „+ Cl(I—D,,R-'D1,)C, =0 (6.189) 
Y„(4— B, DI, R~1C)" +(4A— B, DI, RC) Y, 
—Y,,CTR™'CY,, + B,(I— Dt, R-'!D,,) BY =0 (6.190) 


By ‘stabilizing’ solutions we mean that X,, = XT and Y,,=Y1,, and the 
matrices A + BF and A + HC have all their eigenvalues in the open left half- 
plane, where F and H are defined by (6.191) and (6.192) below. 

Equations (6.189) and (6.190) have the same form as the Riccati 
equations encountered in Chapter 5, except that the constant and quadratic 
terms are generally not positive-semidefinite. For example, BR~' BT, which 
appears in the quadratic term in (6.189), may be indefinite if y is large enough 
(see equation (6.187)). Because of this, stabilizing solutions to these equations 
may not exist; non-existence of one or both solutions indicates that there is no 
stabilizing controller which satisfies (6.171). The equations can be solved 
using the algorithms given in Chapter 8; more specific algorithms have been 
developed recently which exploit the special form of (6.189) and (6.190), but 
those described in Chapter 8 are sufficient in most cases. 

Having obtained X „ and Y,,, we define 


F = —R~1(DT,C, + B'X,) (6.191) 


H = —(B,D', + Y, C7) RT! (6.192) 
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and partition them: 
F=[Fi, Fi, F3 (6.193) 
H=[H,, H,, H3] (6.194) 


with F,,, F,, and F, having m, — p2, p2 and m, rows, respectively, and H,,, 
H, and H, having pı —m,, m, and p, columns, respectively. 

Now we can state the main result (in which p(.) denotes the spectral 
radius): 


Theorem 6.7 (Glover and Doyle, 1988): 


(1) A stabilizing controller exists, such that || F,(P, K) || < y, if and 
only if 


(a) y> max{ē[[Di111 D112] ELDI Di] (6.195) 


and 


(b) there exist solutions X „ > 0 and Y,, > 0 of (6.189) and (6.190), 
respectively, such that 


P(X o Yo) <7? (6.196) 


(2) If (a) and (b) above are satisfied, then all (rational) stabilizing 
controllers K, for which || F,(P, K)||,, < y, are given by 


K = F,(K,,®), (6.197) 
for any rational beH, such that |}®||,,<y where K, has the 
realization 

K; (4. 1B, alel Bt Jail (6.198) 
and 

Dy, = — D121 Do yi (YI — D1111 D1111) Diii — D1122 


(6.199) 


D> and D>; are any matrices (of size m, xm, and p, X po, 
respectively) such that 


D,,D1, = T—Dyyai (yI — Dis, D1111) Dii (6.200) 


Di; Dz; = I— D] (° = D1111 DIa) Daria (6.201) 
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(these can be obtained by Cholesky factorization, for example — see 
Golub and Van Loan, 1983); the remaining matrices in (6.198) are 


given by 
B, = (B3 + H,2)D,3 (6.202) 
Ĉĉ, = — D, (Cz + Fy2)Z (6.203) 
B,= —H, + B, Di2 D; (6.204) 
Ĉĉ = F,Z + D, Dz} Ĉ, (6.205) 
Â=A+HC+B, D} Ĉ, (6.206) 
where 
Z=(I-5 Y, X)! (6.207) 


One member of the family of solutions is obtained by taking ® = 0, 
and this is called the central or maximum-entropy controller. It has the 
realization 


K: (A, B1;C1, D11) 
and so has the same state dimension as the realization used for P. 


The above algorithm assumes that D,, = 0, namely that the plant is 


strictly proper. If this is not the case, it is sufficient to do the following. Apply 
the algorithm to 


A 0 0 
p=P-| i al (6.208) 


and obtain a controller K. Then form the controller 

K=K(I+ Da Ky! (6.209) 
Figure 6.16 shows the structure of P and K in terms of P and K. It is 
straightforward to show that K stabilizes P if K stabilizes Ê (from 
Figure 6.16, for example), and that 


F,(P, K) = F,(P, K) (6.210) 


The only slight complication is that any controller K(s) for which 
det (7 + Dy) K(o)) = = 0 is not admissible. 
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Figure 6.16 Controller for a plant which is not strictly proper. 


To find a controller which minimizes || F,(P, K) || it is necessary to 
use the algorithm iteratively, reducing y until the limiting value yg is reached, 
such that p(X » Y..) = y4, or until one or other of the two Riccati equations 
(6.189) and (6.190) fails to have a positive semi-definite solution. 

It can be shown that, if y is increased to very large values, then the 
central controller generated by the Glover—Doyle algorithm converges to the 
LQG controller which minimizes || F,(P, K)||,. There are many connections 
between the H „ theory and LQG(H,) theory, some of which are highlighted 
by Doyle et al. (1988). The most obvious similarities are that both approaches 
require the solution of two Riccati equations, and both yield a controller with 
the same state dimension as the original problem description -namely the 
state dimension of the plant, together with any augmenting systems. 


6.8 Design example 


6.8.1 Design specification 


We apply the H „ design method to the AIRC model used in earlier chapters 
and defined in the Appendix. The design specification is essentially the same 
as in Chapter 5: a closed-loop bandwidth of about 10rads~‘, with reason- 
ably damped responses, and zero sensitivity at zero frequency. Since we have 
the benefit of having already found some controller designs for this plant, we 
can be more specific and aim to improve on the LQG/LTR design we 
obtained in the previous chapter. 

In particular, we shall see whether we can keep max(|S||,,. {| T ||.) 
below the level achieved with the LQG/LTR controller (where S is the 
sensitivity and T the closed-loop transfer function), while reducing o(S) more 
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quickly below 10 rads~', and reducing (7) more quickly above 10 rads~'. 
In order to do this, we shall solve the mixed performance and robustness 
problem defined in Section 6.3.1; that is, we take 


F,(P, K) = ie Z| (6.211) 


We choose the weights to be 
W,(s)=w,(s)Iz and W,(s)=w,(s)I; (6.212) 


where w,(s) and w,(s) are scalar transfer functions. 
The functions 


(s + 6)? 


w,(s) = s(s + 0.6) (6.213) 


and 
10 50 
W2(s) = Eee I io) (6.214) 


penalize S and T, respectively, slightly more severely than the weights which 
are implicit in the trade-off obtained with the LQG/LTR design (see 


100 
90 
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107? 107? 107! 10° 10! 10? 10° 
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Figure 6.17 Weightings on S (solid curve) and T (dashed curve). 
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Figure 5.17), and both have a gain close to unity near 10rads~ +. We shall 
have to modify these functions slightly in order to shift the pole of w, (s) away 
from the origin, and to make w,(s) proper; details of the required modifi- 
cations will be given below. The gains of the modified weighting functions are 
shown in Figure 6.17. 


6.8.2 Application of the Glover-Doyle algorithm 


We saw in Section 6.3.1 that we require 


P P W, | =s WG 
P=| _" 2|- 0 | WG 6.215 
is Py, Se Sieesaauts pee 
I! -G 


where G is the transfer function of the plant, so that, in the notation of 
Section 6.7, we have m, = 3, m, = 3, p, = 6 and p, = 3. (We have a ‘2-block’ 
problem, since P,, is square.) If the plant has the realization G: (A, B, C, D), 
and each of the weighting functions has the realization W;: (A;, Bi, Ci, D;) 
(i= 1,2), then it is straightforward to show that P has the realization 
P: (Ap, Bp, Cp, Dp), where 


TA 0.0 0 B 
A=|BC A, 0l, B,=|—B, B,D (6.216) 
[B,C 0 4, 0 B,D 
-D;C -C, 0 D, —D,D 
D,C 0 cC 0 D,D 
C,= 2 2 : D,= 2 
| =C 0 0 I; —D 


The partitions in (6.216) correspond to the partitions between P,,, P12, P21 
and P,,. (This is not the only possible realization of P, of course, but it is the 


easiest to obtain.) 
If each of the scalar weighting functions w,(s) has the realization w;: 


(«;, Bis Yi ôi) then realizations of W; are given by 


A, = diag {a;}, B, = diag { B;} 
C;= diag {y; }, D; = 6;1; 


(6.217) 


where the matrices «;, 8; and y; are repeated three times on each principal 
diagonal, since our plant has three outputs. Clearly w, requires a two-state 
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realization, and we shall have to add two poles to w, in order to make it 
realizable, so it too will require two states. Each of A, and A, is therefore a 
6 x 6 matrix, and, since the plant has five states, A, has dimensions 17 x 17. 
(The other dimensions are: B,eR'’”®, C,eR°“*7 and D,eR®*®,) 
Inserting poles at — 1000 makes a negligible difference to w, over the 
frequency range of interest to us, so in place of (6.214) we shall use 


_ 2000(s + 10)(s + 50) 
wals) = — 57 1000) 


(6.218) 


the gain being chosen to given w, (0) = 1. 

Now we need to consider the various rank conditions which are 
required by the Glover-Doyle algorithm. First of all, we require that rank 
[- (D, DY (D,D) ]" = m, = 3, and rank (Z) = p, = 3. The second of these is 
satisfied, but the first is not, since D = 0 for our plant. We therefore need to 
modify D so that the rank condition is met, but in such a way that the 
behaviour of the plant is changed very little over significant frequencies. Since 
a (G(j100)) % 5 x 1075, then replacing D by 


D = 10751 (6.219) 


should have no perceptible effect on the plant’s behaviour below 100 rads7'. 
We therefore replace D,, as defined in (6.216), by 


D, | -D,D 
D,=| 0 | B,D (6.220) 
I, | -D 


Note that we leave — D unchanged in the lower right corner of D,, since the 
algorithm is actually simplified if P}, is strictly proper (condition (6.183)). 
Rank condition (6.185) becomes, at œ = 0, 


A 0 0 0 
a Sie 4$ 20 6.221 
ran =n = i 
B,C 0 A, 0 Pa ey 
-C 0 0 I 


It is easy to see that this condition is violated, since both A and A, have 
eigenvalues at zero. We therefore use 
A 0 0 
A,=| B,C A, 0 (6.222) 
B,C 0 A, 
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in place of the earlier definition (6.216), and define A and A, as follows. 
Setting A = A, except for A(1, 1) = 4, replaces the eigenvalue at 0 by one at å 
and leaves all the other eigenvalues of A unchanged. Choosing 4 = —0.001 
has very little effect on the behaviour of the plant, except at very low 
frequencies. Examination of the principal gains of both the original plant 
G: (A, B, C, D), and the modified version G: (A, B,C, D) shows that almost no 
difference is perceptible between 0.001 and 100rads~'. 

The origin eigenvalues of A, are shifted by changing w, (s); clearly we 
need to insert a pole at some very low frequency instead of zero. If we wish to 
force three ‘integrators’ into the controller, rather than two (similar to what 
we did in the earlier designs), then this frequency should be lower than |A|, so 
that the controller is forced to inject gain down to this frequency (since the 
plant gain levels out below |4|). We therefore replace (6.213) by 


(s + 6)? 
(s + 0,00006)(s + 0.6) 


w,(s) = (6.223) 


and use the corresponding A 1% 

Rank condition (6.184) is satisfied at œ = 0, even without the changes 
made above. Conditions (6.184) and (6.185) are equivalent to the conditions 
that the systems with realizations (A, B,,C,,D,,) and (A, B, C2, D31 ) (using 
notation as in (6.184) and (6.185)) have no transmission zeros, and no 
pole—zero cancellations, on the imaginary axis. One way of checking this is to 
plot the principal gains of these systems across a wide range of frequencies, 
and see whether any of them approaches zero (since a transfer function loses 
rank at a transmission zero). Note that the first of these systems is not square, 
and so is extremely unlikely to have any transmission zeros (see Chapter 2). 
Such checks reveal that there are no further problems with these rank 
conditions. 

We now have a problem description which can be submitted to the 
Glover—Doyle algorithm. We shall not give a detailed description of the 
computations required, except to note that, since m, = p3, the matrices D,; 4; 
and D,,>, (in (6.182)) do not exist, and therefore (6.199)—(6.201) simplify to 


Dis = —Dy 122; D,» = l, Da, = I, (6.224) 


respectively. 


6.8.3 Adjustment of y and weights 


Before using the algorithm, we need to choose a suitable value of y such 
that || F,(P,K)||,<y. From the LQG/LTR design we know that 
max(||S||,,. || 7.) <4dB, approximately, should be feasible. From 
Figure 6.17 we see that both |w; (j@)| and |w,(ja)| take values close to 3 dB 
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Gain (dB) 


10° 10! 10? 
Frequency (rad s~') 


Figure 6.18 Principal gains of S and T with y=101°. 


near œ = 10, which is where a(S) and o(T) are likely to have their peak 
values. We can therefore estimate that a value in the vicinity of 7 dB, or 2.2, 
should be possible. This estimate may be optimistic since our weighting w, (s) 
is intended to force a faster ‘roll-off’ above œw = 10 than was achieved by the 
LQG/LTR design, and this may force larger peaks of S or T. 

We begin by choosing a very large value of y, 10°. As remarked in 
Section 6.7, the limiting solution of (6.171), as y is increased indefinitely, is in 
fact the ‘LQG’ solution which minimizes || F,(P, K)||2. It is useful to start with 
this solution, if only to ‘debug’ the problem definition. The algorithm yields a 
positive-semidefinite stabilizing X „ (solution of (6.189)), as expected, but, 
rather surprisingly at first, Y,, = 0. In fact this is not unusual for 2-block 
problems, and depends on the locations of the transmission zeros of certain 
subsystems. We take the (17-state) central or maximum-entropy controller 
defined by 6.199 and (6.204—6), and this gives principal gains of S and T as 
shown in Figure 6.18. This already looks like a useful design, with low- and 
high-frequency behaviour much as expected, and the bandwidth about 
right. The peak value of a(S) is 5dB, which occurs at 7rads~!; since 
|w: (J7)| + 5dB, this design certainly gives || F;(P, K) || œ >10dB=3.16. Note 
that o(F,(P,K)) does not reach its peak value at œ =7, however; for 
example at œw=1 we have o(S)=—17dB and |w,(j)| =30dB, so 
| F\(P, K} o = 13dB = 4.47. 

Starting with this initial design, we reduce y until we find the limiting 
value. The evidence so far suggests we should look in the range 1 < y < 10; in 
fact condition (6.195) indicates that we can restrict the search to 2 =< 10, 
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10° 10! 102 


Frequency (rad s~') 


Figure 6.19 Principal gains of S and T with y=3.5. 


since ¢(D,,,2) = V2 for our problem. Since we always obtain the solution 
Y,, = 0, condition (6.196), namely p(X» Y») <y’, is always satisfied. The 
limiting value of y is therefore indicated by (6.189) failing to have a positive- 
semidefinite stabilizing solution. As the limiting value is approached, the 
solution X „ becomes unbounded, and when it is exceeded (y too small) only 
indefinite solutions can be obtained. 

A little iteration ‘by hand’ showed that the smallest value which can be 
achieved is approximately* y = 3.5. With this value of y, the principal gains of 
S and T are as shown in Figure 6.19. The effect of reducing y is to increase the 
bandwidth a little, but || S||,, and || 7'||,, have hardly been affected. This has 
occurred, of course, because (W, S) and ¢(W,T) achieve their peak values 
at frequencies other than those at which o(S) and o(T) reach their peak 
values. 

The results obtained so far indicate that the weights w; (s) and w,(s) 
do not quite reflect our intentions. They have certainly forced a(S) and o(T) 
to reduce rapidly for œ < 10 and œ > 10, respectively, but they have not given 
enough emphasis to the reduction of ||S||,, and |T]. At this stage it is 
necessary to go back to the beginning and adjust the weights to reflect the 
designer’s intentions more accurately. With well-conceived software this need 
not be very time-consuming, but we shall limit ourselves to just one further 
iteration. Figure 6.19 shows that the current weights place more emphasis on 
keeping ¢(7) small than on keeping o(S) small, so if we just increase w; (s) by 
a constant gain, without changing its dynamics, we should be able to decrease 
|| S|, relative to || 7'||,,, and thus perhaps reduce max(||S||, || Tila) 
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Gain (dB) 


20 f feck a {$ 
10° 10! 10? 
Frequency (rad s7 ') 


Figure 6.20 Principal gains of $ and T with w, increased by a factor of 4, and 
y=7.8. 


With W, increased to W; (s) = 4w, (s), the limiting value of y is about 
7.8, and the resulting gains of S and T are as shown in Figure 6.20. Both || S || ,, 
and || T||,, have been reduced significantly, to about 3 and 2 dB, respectively. 
As expected, (S) has been reduced at frequencies lower than 10rads7~}, 
while o(7) has been increased at higher frequencies. Figure 6.21 shows the 
step responses obtained with this design. Note that the steep ‘roll-off’ of ¢(T) 
which has been achieved (compared with previous designs) has resulted in the 
initial slopes of the responses being smaller. This indicates that the control 
signals required are considerably smaller than in the earlier designs. 


6.8.4 Comparison with previous designs 


When comparing the H „ procedure described for this example with the LQG 
procedure used in the design example in Section 5.8, it should be remembered 
that we have described one particular way of using the H „ approach, and one 
particular way of using the LQG approach. Many of the differences which are 
apparent arise from the choice of these particular variants of the methods, 
and not from the use of the H „ rather than the LQG problem formulation. 

The only difference which is a result of choosing the H „ procedure is 
that it is easier to control the peak values of ¢(S) and ¢(T), whereas the LQG 
approach manipulates all the principal gains simultaneously. This is a 
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Time (s) 


Figure 6.21 Closed-loop step responses to step demand on output 1 (solid 
curves), output 2 (dashed curves) and output 3 (dash-dotted curves). 


significant difference, of course, and the prime justification for using the H,, 
approach is the claim that it is precisely these peak values which one usually 
needs to control. 

However, most of the apparent differences between the two design 
examples result from the use of a closed-loop approach here, and an open- 
loop approach in Section 5.8. This accounts for the amount of attention we 
had to devote to the choice of weighting functions, since these really encapsu- 
late all the design decisions. One advantage of the closed-loop approach over 
the LQG/LTR approach is that it makes it easier to control the behaviour 
of a(S) and o(T): we were able to achieve steeper reductions of these— 
particularly of ¢(7)—than in the LQG/LTR design, and to do so with smaller 
peak values of a(S) and o(T). 

A major disadvantage of the closed-loop approach, however, is one 
which has not yet become apparent. If we examine the transmission zeros of 
the controller, or the eigenvalues of the closed-loop system, we discover that 
the controller cancels the plant’s resonant poles at — 0.018 + 0.182); this may 
be unacceptable, as discussed in Chapter 4. As usual, this happens because 
a closed-loop specification has been imposed without regard to the plant’s 
characteristics. Although LQG controllers—such as the one obtained in 
Section 6.8.3 with y = 10°-—generally do introduce such cancellations, the 
LQG/LTR procedure avoided doing so, since the open-loop return ratio 
which was recovered was allowed to exhibit the same resonance as the plant. 
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Undesirable cancellations can be avoided, even with closed-loop approaches, 
by using more elaborate problem formulations which result in restraints on 
the principal gains of transfer functions such as SG (see Section 4.8.2). 
Alternatively, McFarlane and Glover (1988) have recently proposed an H „ 
design procedure which relies on open-loop gain shaping, and therefore offers 
the prospect of avoiding such cancellations. 


6.9 Review and comments 


6.9.1 The Youla parametrization 


The development of the Youla parametrization and its consequences — the 
structure of all stabilizing controllers, for example—has introduced a new 
theory of fundamental importance into the subject of linear-feedback sys- 
tems. It is rather surprising, therefore, to find that the parametrization itself 
was already known in the 1950s, when it was applied in studies of optimal 
control (Raggazini and Franklin, 1958, Chapter 7). Its deeper implications 
went unnoticed at the time, however. It was revived in the 1970s by Kucera 
(1974, p. 116) and Youla et al. (1976), its current name being taken from the 
latter. The general form of the parametrization, as given by Theorem 6.4, 
was developed by Desoer et al. (1980)° with the aid of coprime fractional 
representations. (See also Callier and Desoer (1982).) The state-space inter- 
pretations were developed by Doyle (1984); a crucial ingredient of this work 
was the paper by Nett et al. (1984) which first showed how to obtain state- 
space realizations of fractional representations. 

Whereas Kucera (1974) and Youla et al. (1976) employed the para- 
metrization for quadratic (H,) optimization, Zames (1981) introduced it into 
the solution of H „ problems. 


6.9.2 Alternative approaches to H,, optimal control 


In this chapter we have presented effective algorithms for solving H „ design 
problems, and in order to do this, and to link with earlier chapters, we have 
placed much emphasis on state-space descriptions. One effect of this has been 
to obscure the fact that much of the theory of H,, optimization is best 
understood in the context of complex analysis. That is, it is essentially a 
frequency-domain theory. Most of the new terminology, such as ‘H „’ itself, 
comes from that field (see Rudin (1966), for example), and all the early 
theoretical contributions to the development of the theory (for example 
Zames, 1981; Zames and Francis, 1983; Francis and Zames, 1984; Francis et 
al., 1984) were set in that context. 

Above all, the complex-analysis setting gives an essential basis for the 
development of algorithms which can be applied to systems containing time 
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delays, or which need to be represented by irrational transfer functions. Such 
algorithms are still being developed. 

We shall now outline very briefly how complex analysis comes into 
the feedback design problem. Consider SISO systems first. If the plant has a 
zero at s = z; in the right half-plane, then we know that we cannot attempt to 
cancel that zero by a pole of the controller, if we are to maintain internal 
stability. Thus we must have 


= G(z,)K(z;) os 
Ti) = Gis) Kin) GKE) > 0 (6.225) 


and hence the sensitivity function S(s) = 1 — T(s) must satisfy 
S(z;) = 1 (6.226) 


If we attempt to minimize | S(j@)|, as Zames (1981) did, we must do so subject 
to the constraint (6.226), which acts at each right half-plane zero of the plant. 
We also require S(s) to be analytic in the right half-plane in order to obtain 
closed-loop stability. The problem is therefore equivalent to solving an 
interpolation problem in which the interpolating function is constrained to 
be analytic in a half-plane, as well as being optimal. This is a standard 
problem in complex analysis, known as the Neyanlinna—Pick problem. 
For multivariable systems we have 


det 7(z;) =0 (6.227) 
and hence 
6[S(z;)] 21 (6.228) 


We can proceed by exploiting either the theory of matrix-valued 
Nevanlinna—Pick problems, as have Chang and Pearson (1984), or operator 
theory, as have Francis et al. (1984) and Francis (1987). 

The development given in Sections 6.4 and 6.5 of this chapter follows 
Doyle (1984) and Chu (1985). The link between Sections 6.4 and 6.5, namely 
the equivalence of the H „ problem to the ‘model-matching’ problem, was first 
established by Safonov and Verma (1985). 


SUMMARY 

This chapter has covered a wide range of topics, all of them intimately 

connected with the relatively new H „ theory of feedback design. 
Coprime factorizations of multivariable linear systems have been 

introduced, and methods of obtaining state-space realizations of such 
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factorizations presented. These factorizations have been used to explore 
and exhibit the structure of all stabilizing controllers for a system, and to 
obtain a simple parametrization of all closed-loop transfer functions. 

A derivation of the solution of 1-block H „ problems has been given, 
and the Glover—Doyle algorithm has been presented (without derivation) 
for the solution of general H,, problems. The topic of Hankel approximation 
has been introduced, and it has been shown to have applications in H œ 
optimal control theory and in model approximation. 

A design example has been presented. 

As was the case in Chapter 5, most of the material in this chapter has 
again assumed that the plant can be described by a state-space model with 
finite state dimension. 


Notes 


1. F,(P, K) is called a linear fractional transformation of P and K. The subscript 
stands for ‘lower’, and indicates that K is below P in Figure 6.2. Had we drawn 
the feedback path containing K above P, then we would have needed the 
‘upper’ transformation 


F,(P,K) = Paa + Po, K(I — Py, K) ' Py 


Of course everything here is a function of s. We shall sometimes write 
F,(P(s), K(s)), but we shall usually omit the argument. 


2. We saw in Chapter 3 that this formulation corresponds to minimizing 


Iyi? -Í y? (t)dt 


0 


if the only thing known about the disturbance d is that ||d||, < c, where c is 
some positive number. 


3. From Schur’s formula: 
A B 
det] |- der adero- cam) 
C D 


4. The accuracy with which the limiting value of y can be determined depends on 
the sophistication of both the search strategy and the Riccati-equation solver. 
In this case both were low. The search was performed manually. The Riccati 
equations were solved by using the function ‘LQR’, which is provided with the 
PC-Matlab and Pro-Matlab Control System Toolbox software, with the 
checking for definiteness of the matrices Q and R suppressed. This function is 
not intended for solving Riccati equations with indefinite terms, but it usually 
works. It is likely to fail sooner than a more tailor-made algorithm, however, 
as the limiting value of y is approached. 


5. In works written or co-written by Desoer the parametrization is invariably 
known as the ‘Q parametrization’. 
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6.1 Ifa plant with transfer function G is stable, find an expression for the 
closed-loop transfer function 


T =GK(I+GK)"! 


in terms of the Youla parameter Q. 

If the plant has more inputs than outputs, show that the 
‘perfect tracking’ condition T=J can be approached arbitrarily 
closely over any desired frequency range (for almost all plants). Why 
can it not be achieved exactly? What additional condition is required 
if the number of inputs equals the number of outputs? 


6.2 Figure 6.22 shows the structure of internal-model control (Garcia and 
Morari, 1982). The plant G is assumed to be stable. Show that, if C(s) 
is stable and Go (s) = G(s), then the closed loop is stable. Show also 
that any loop compensator K(s) can be interpreted as having the 
internal-model control structure (assuming that Gp = G and G is 
stable), find the corresponding C(s), and hence deduce that C(s) is the 
same as the Youla parameter Q(s). 


Figure 6.22 Internal-model control. 


6.3 If P is stable in Section 6.4.4, show that one can take 7, = Py,, 
Tiz = Pia and T3; = Pa. 


6.4 Suppose that P (in Section 6.4.4) has fractional representations 


P=N,M;!=M;'N, 


6.5 


6.6 


6.7 
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and that 


N,= Pa a] Ñ,= [z ai] 
Na: N22 Nat N22 
the partitioning corresponding to that of P. Given the realizations 
(6.123) and (6.124) for 7,, and 7,,, show that T,, = Ni2 and 
Th, = T N34. 
(H int: If F is a stabilizing state feedback for the plant P,,, then [0 FT]" 


is a stabilizing state feedback for P if the realization (6.120) is 
assumed.) 


Show that the state-space equations corresponding to Figure 6.15 are 
x| | A BF x a B 
¿| L-ac A+Br+uc]||cl*| Bl” 
w=[(C oC] | ; | 


By defining a new state variable e = x — , so that 


x| JI 0 || x 
ej Ll -I]L¢ 
show that the transfer function from y; to u; is zero. 


If X(s) has realization (A,B,C, D), show that X*(s) = X T(—s) has 
realization (— A‘, — CT, BT, DT). 


Consider the following feedback system: 
Plant: y=G(u+d) 
Controller: u= K(r+y) 


Errors: e= Wu, e,=W,(rt+y) 


It is known that ||r||, < 1 and ||d||, < 1, and it is desired to design K 
so as to minimize 
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Show that this can be formulated as a standard H,, problem with 


o 0 W, 
P, = P., = 
at ea ak 12 EA 


P= [I G], Pan = G 


If G is stable (and W, and W, are chosen to be stable, of course), show 
that the resulting optimization problem reduces to 


k + a 
R2; 
even though P,, is not square. 


(Hint: Show that, in this case, Ry, = R22 = 0. The result actually holds 
if G is unstable too (Chu, 1985).) 


min 
QeH, 


foe] 


Suppose that G has minimal realization G: (A, B, C, D). If N and M 
have realizations 


NÑ: (A+ HC, B + HD,C,D) 
and 
M: (A+ HC, —H,C, —1) 
where 
H= -XC" 
and X is the stabilizing solution of the Riccati equation 
AX + XA™— XC'CX =0 
show that G = M`! Ñ and MM* = I (namely, that M is all-pass). Use 
the following lemma, due to Glover (1984): 


Lemma 6-1: If G(s) has minimal realization G: (A, B,C, D), then G is 
all-pass, namely GG* = I, if 


(1) 
DD =I 


(2) There exist X and Y, such that 
CX+DB™=0 and B'Y+D'™C=0 
where X = X" and Y= Y", 


AX+XA™+BB'=0 and ATY+ YA+C™C=0 


6.9 


6.10 


6.11 


6.12 


6.13 
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Suppose that G has at least as many columns as rows, minimal 
realization (A, B,C, D), and det(DD™) #0. Let N and M have the 
realizations 


Ñ: (A + HC, B + HD,(DD*)~1/2C,(DD™)- "2? D) 
and 
M: (A + HC, H,(DD")~ "2C, (DDT) 1/2) 
where 
H=- XC" 
and X is the stabilizing solution of the Riccati equation 
AX + XA" — (XC! + BD")(DD1)-!(CX + DB") + BB" = 0 


Show that G = M`! Ñ and ÑN* =] (namely, that Ñ is all-pass). 
(Hint: Use Lemma 6.1 above. Note that a suitable solution X exists 
only if G(jæ)GT(—jæ) > 0 for all œ.) 


Work through the Glover—Doyle algorithm of Section 6.7 assuming 
that D,, = 0, and observe the consequent simplification. 


Show that, if 
a 0 0 
P=P4 f A 


K =K(I+D,,K)"! 


and 


then 
F (P, K) = F,(P, K) 


(that is, show that (6.210) holds). ` 
Find a state-space realization of K, given a realization of K. 


Show that the Hankel norm ||G ||, is an input/ output invariant, namely 
that it remains unaltered if a different realization (TAT~1, TB, 
CT™!, D) is taken for G. 


Let (A, B, C, D) be a minimal realization of GeH oo; and let P = PT and 
Q = Q" be the solutions of the Lyapunov equations 


AP + PA™ + BBT =0 
ATQ+QA4+C'C =0 
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6.14 


6.15 


6.16 


6.17 


6.18 
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It can be shown that P > 0 and Q>0. 
Let Q = R™R. Show that there exists U such that U"U = I and 


RPR" = UX? U" 
where E = diag {o1,02,...,0,},0, 20.2... 20,>0. 


Hence show that the realization G: (TAT~+,TB,CT',D), 
where 


T=2~'? UHR 


is balanced in the sense defined in Section 6.6.2. 


(a) Solve the robust stabilization problem defined in Section 6.3.1, 
using the general H,, problem formulation. Assume that 
r(s) = constant, and G is square. Show that this requires the 
solution of Riccati equations of the form 
ATX + XA—XBB'X =0 
(b) Obtain the solution if the plant is stable, and interpret it. 
(c) Obtain the solution if the plant transfer function is 


0.1 s—10 


G(s) = 
(s) s—0.1 + 5410 


and 
r(s) = 0.25 


Repeat Exercise 5.3, using the H o approach. 
Repeat Exercise 5.4, using the H,, approach. 
Repeat Exercise 5.5, using the H o approach. 


In the design example of Section 6.8, controller poles occur at 
—6 x 1075, instead of the origin. Consider methods of obtaining a 
state-space realization of the controller in which these poles have been 
shifted to the origin. 

(Note: Section 8.3.6 shows how to obtain a decomposition K = 
K_+XK,, in which all the poles of K- lie to the left of some real 
number ¢, and all the poles of K, lie to the right of e.) 

In practice, would it be necessary to have the controller poles exactly 
at the origin, or could they be left at —6 x 1075? 


re 
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CHAPTER 7 
Design by Parameter 
Optimization 


7.1 Introduction 7.5 Conclusion 
7.2 Edmunds’ algorithm Summary 
7.3 The method of inequalities Exercises 
7.4 Multi-objective optimization References 


7.1 Introduction 


It frequently happens that the structure of a controller is constrained in some 
way. In the process industries, for example, there are considerable costs 
associated with each non-zero element of the controller’s transfer-function 
matrix. Every such element represents a considerable amount of hardware, 
possibly extensive additional wiring, additional testing and maintenance 
procedures, and additional training for the plant’s operating personnel. There 
are therefore strong pressures to keep the number of ‘cross-couplings’ as low 
as possible. In aerospace applications, considerations of reliability may 
dictate an analog realization of the controller, with the smallest possible 
number of states. 

Such constraints are not respected by any of the design techniques we 
have described in earlier chapters, except for the direct and inverse Nyquist- 
array techniques when driven entirely manually (so that the designer has 
explicit control over the controller’s structure), and the QFT approach. The 
designer using the approximate commutative controller approach can con- 
trol the McMillan degree of the controller, but not the distribution of the 
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dynamics over the controller’s elements: in general, each element of the 
controller’s transfer function will have the same complexity. With the LQG 
and H„ approaches he loses control of the McMillan degree as well, and 
cannot enforce the open-loop stability of the controller. He must therefore 
either split the design problem into sub-problems which do not have struc- 
ture constraints, and apply these design techniques to the sub-problems, or 
abandon the theoretically based techniques and adopt an ad hoc approach. 

In this chapter we shall outline three such ad hoc approaches to the 
problem. They all consist of choosing the controller structure somehow, and 
then optimizing the free parameters within that structure. By modifying the 
problem formulation, the method of choosing the structure and the opti- 
mization algorithm, one can invent an endless variety of ‘design techniques’. 
The three we have chosen to present have been selected because they combine 
simplicity with reasonable effectiveness. 


7.2 Edmunds’ algorithm 


Edmunds (1979) proposed an algorithm for optimizing controller parameters, 
to make the closed-loop transfer function T = GK(I +GK)~' approach a 
target transfer function as closely as possible over a specified frequency range. 
If a quadratic error function is minimized pointwise over frequency, and some 
approximations are made, the problem can be posed as a linear least-squares 
problem to which a standard solution is known. This makes it straight- 
forward to code the algorithm. Another attraction (in the context of this 
book) is that the resulting design technique is a frequency-domain one and 
can therefore be used particularly naturally in conjunction with the Nyquist- 
like techniques discussed in Chapter 4. 

However, the algorithm is rather crude for the task at hand and needs 
to be given a sensible target to aim for, together with a compatible controller 
structure, if it is to succeed. It is particularly suitable, therefore, for ‘tuning’ 
parameter values which have already been obtained by some other method. 


7.2.1 The algorithm 


We now describe the algorithm. Suppose, as usual, that our plant is represen- 
ted by the transfer function matrix G, with | columns (inputs) and m rows 
(outputs), and that we are to design a controller with transfer-function matrix 
K (with m columns and | rows). Let the closed-loop transfer function actually 
achieved by a controller K be T= GK(I+GK)~', and let the ‘target’ 
transfer function which we would like to achieve be T,. Corresponding to 7, 
there is a ‘target’ controller K, such that 


GK, = T(I-1)’ (7.1) 
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We define an error function 

E=T,—T (7.2) 
Then it can be shown, with a little manipulation, that 

(I —T)X(GK,— GKI -T,) = E (7.3) 


If we suppose that ||E|| is sufficiently small, which will be the case if K is 
sufficiently close to K,, then, by replacing I— T by I— T, in (7.3), we obtain 


M- T,)(GK,—GK)I—T,) = E (7.4) 
since 


(I — T)(GK, — GK)(I— T,) = (I — T,) (GK, — GK)(I — T,) + O( ||?) 


(7.5) 
Now let us write 
1 

K(s) = PTs (s) (7.6) 


where d(s) is a common-denominator polynomial which is assumed to be 
known, and N(s) is a matrix of polynomials of known degrees but with 
unknown coefficients. Finally, we define 


B(s) = I — T(s) (7.7) 

A(s) = 76 2160 (7.8) 
and 

Y(s) = B(s)G(s)K,(s)B(s) (7.9) 


Then (7.4) becomes 
Y(s) = A(s)N(s)B(s) + E(s) (7.10) 


The noteworthy features here are that the unknown coefficients in N(s) 
appear linearly in this expression, that A(s), B(s) and Y (s) are all known and 
can be evaluated at particular values of s when required and, hence, that the 
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problem of finding N(s) which minimizes 


IEI% = | tr[E™(—ja)E(jo)]de 
is a linear least-squares problem if the approximate equality in (7.10) is 
replaced by exact equality. 

To put (7.10) into the more familiar standard form in which linear 
least-squares problems are usually seen, we need to ‘stack’ the columns of Y, 
N and E on top of each other. For this purpose we define their columns by 


¥(s) = [y1 (5) - - - yn (5)1 | (7.11) 
N(s) = [n ($) . . - Mn(s)] (7.12) 
E(s) = [e4 (s) - . . en (s)] (7.13) 


We also need to use the ® notation for the Kronecker or tensor product of 
two matrices: if P has p rows and q columns, and Q has r rows and s columns, 
then P Q Q is the pr x qs matrix: 


PQ PQ ..- PisQ 


pagel” Baa ove Paik (7.14) 
PO 20 “os Y, 
In this notation, (7.10) can be rewritten as 
yi(s) n,(s) e,(s) 
28) | rare ae | | + | 2 (7.15) 
Val) nal) enl) 
Remember that n;(s) represents a vector of polynomials: 
ni(s) = [n;(S) . . . m(s)]* (7.16) 
and suppose that 
n,j(s) = vos? + vys? + o. +i (7.17) 


for some positive integer p, assuming for notational convenience that each n,; 
has the same degree. (But this is not a real restriction, since vj, = 0 is allowed, 
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and can be forced if desired.) Then {v3} is the set of controller parameters to 
be optimized; if each n;; has degree p, there are Im(p + 1) of them. We need to 
introduce one more new notation: let D(s) be the matrix (with Im rows and 
Im(p + 1) columns) 


s? sP~1 | 0 
s? ght 1 
2(s)= 0 
s? OPS ta 
(7.18) 
then 
n,(s) 
= X(s)v (7.19) 
Am (S) 
where 
v= te v] g (7.20) 
So if we let 
X (s) = [B"(s) ® A(s)] Z(s) (7.21) 
n(s) = [yi(s)... yR(s)]" (7.22) 
and 
e(s) = [et(s)... eX(s)]7 (7.23) 
then (7.15) becomes 
n(s) = X(s)v + e(s) (7.24) 


which is in a standard form: n(s) is a known vector, X(s) is a known matrix, v 
is a vector of unknown parameters and é(s) is a vector of ‘errors’. 

To obtain a practical algorithm, we need to evaluate y(s) and X(s) ata 
number of points on the imaginary axis, say {s =jo,;;i=1,2,..., u}, and 
approximate ||E||, (which is the same as lella) by 


lel? = § sT(-jo)el jo) (7.25) 


i=l 


It 
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Assembling data from all these points, we obtain 


(jæ) X (jo) e(jo) 
: N : v+ : (7.26) 


n(jo,) X(jo,) e(jo) 


The standard least-squares solution to this would be (Lawson and Hanson, 
1974) 


X(jæ;) a 
v= 4 [X™(—jo,)... X"(—jo,)] 
X(jo,) 
n(jo1) 
x4 [X™(—jo,)... X"(—jo,)] : (7.27) 
n(jo,) 


but in general this would give complex parameter values. We therefore have 
to depart slightly from the standard problem, in order to obtain real 
parameters. To do this, we use the following lemma: 


Lemma 7.1: If Y= X0+E, the value of @ which minimizes || E||z, 
given X and Y, and subject to the constraint Im {6} = 0, is 


6 = [Re{X"#X}] 1 Re {X"Y} (7.28) 


With the aid of this lemma we obtain the optimal real parameters: 


X(jo;) 
=| Res [X1(—jo,)...X"(—-jo,)] 
X(jo,) 
n(j@1) 
x Re 4 [X1(—ja,)... X™(—jo,)] (7.29) 
n(j@,) 


The validity of this algorithm depends on the validity of (7.4), which in turn 
depends on the size of || E||, obtained with the parameter vector ¥. There are, 
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however, ways of extending its validity. Let the controller obtained from 
(7.29) be K;, and suppose that we somehow obtain another controller 


Ki+1 = K,;+(AK),; (7.30) 


Let the closed-loop transfer functions obtained with these be T; and T;,,, 
respectively, and let 


E,=T,-T, (7.31) 
Then, from (7.3), 

(I — T,)G(K, — K,)(I—T,) = E; (7.32) 
and 

U— Ti41)G(K,—K,4,)U-T) = Eip: (7.33) 
If we assume that 7;,, = T,, then subtracting (7.33) from (7.32) gives 

(1 — TGK] — T) © E,— E (7.34) 

Everything in (7.34) is known, except (AK )\ and E,,,, and (7.34) can 
be put into the same form as (7.10), which again leads to a linear least-squares 
problem (assuming that only the numerator elements of (AK); are unknown). 
Thus (7.34) can be solved to find the (AK); which approximately minimizes 
|E;+1l2, in the same way as before. This clearly leads to an iterative 
algorithm since (7.34) can be repeated as many times as required. 

Edmunds (1979), who suggested this approach, uses 

U- 7)G(AK), E (I-T) = E;— E, (7.35) 
where 


5, = 1+(1—T,)G(K,—K,) (7.36) 


The approximation (7.35) is less crude than (7.34). 
A further refinement, which is advisable, is to replace (7.30) by 


K; a K; + a,(AK); (7.37) 


where a; is a scalar. The reason is that, although a reduced error can probably 
be obtained by changing K; in the ‘direction’ of (AK);, the size of the step 
dictated by (7.30) is almost certainly inappropriate because of the approxi- 
mations involved in (7.34) or (7.35). One can either repeatedly halve «; 
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(starting with «; = 1) until || E;+1 || < /E,||, or perform a more sophisticated 
line search to find the value of «, which minimizes ||E; +; ||- 


7.2.2 Comments on Edmunds’ algorithm 


The main advantage of Edmunds’ algorithm is its flexibility. Selected 
elements of the controller K, and even particular coefficients, can be 
constrained to be zero: corresponding elements of v and columns of X(s) 
are simply omitted in (7.24). It is also straightforward to emphasize the 
attainment of certain elements of T,, at the expense of others, by minimizing 
the weighted error 


lel = $, (jo) Welo) (7.38) 


where W = diag {w,} (wj>0, j=1,2,... , Im), instead of (7.25). This again is 
a standard problem, with a standard solution: 


Lemma 7.2: If Y=X0+E, the value of @ which minimizes 
\Elly = EX WE, given X, Y and W>0, subject to the constraint 
Im {6} = 0, is 

6 = [Re {X#WX}] Re {X#WY} (7.39) 


This leads to the replacement of (7.29) by 


WX (jo) tie 
p=| Re4 [X1(—ja,)... X"(-jo,)] 
Wn(jo1) 
x Re 4 [XT(—ja,)... X"(—Jjo,)] : (7.40) 
Wn(jo,) 


The particular value of this possibility is that one usually has little idea of 
how to choose the off-diagonal elements of T,. A frequently adopted strategy, 
therefore, is to set them all to zero, and to attach very small weight to 
achieving them. This allows an initial controller design to be obtained. The 
weights on the off-diagonal terms can then be increased, if it is important to 
reduce the interaction (off-diagonal) terms in T. 

It is immediately apparent from (7.38) and (7.40) that one can apply 
different weights at different frequencies, which can be useful on occasions. In 
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particular, it is possible for the designed controller to give an unstable closed 
loop, even if T is very close to T,. This is often because the low-frequency 
behaviour of GK is very different from that of GK, (but not affecting T much 
if o(GK) is large), perhaps to the extent of having inappropriate signs of zero- 
frequency gains in K, which leads to violation of the generalized Nyquist 
stability theorem. Such a problem may sometimes be corrected by placing 
more weight on the lower frequencies in (7.40). 

A further advantage of the algorithm is that, although we developed it 
by writing K (s) with a common denominator (see equation (7.6)), it is possible 
to assign a different denominator polynomial to each element of K. 

If T, is block-diagonal, then B (in equation (7.7)), and hence B™ @ A (in 
equation (7.21)), is also block-diagonal, and the problem decomposes into 
several smaller sub-problems. If T, is actually diagonal (which is usually the 
case), then each column of N(s) can be optimized independently, and we have 
m separate problems in each of which there are I(p+1) parameters to be 
optimized. 

It is imperative that the actual algorithm employed solves (7.26) by 
using a numerically stable procedure and does not use (7.29), since in the 
neighbourhood of the true solution the matrix 


X (jæ) 
Re- [XT(—jo,) se X"(—jo,)] 
X(ja,) 


approaches singularity. A numerically stable algorithm is obtained as follows. 
In the notation of Lemma 7.1, let X = Xret+jXims Y = Yke +j Yim, then 6, 
defined by (7.28), is also obtained as the least-squares solution of the equation 


HE 

Yim Xim Em 

This can be solved in a numerically stable way by using the algorithm 
described in Section 8.2.3. There is also scope for using recursive techniques 
to update the solution each time data X (jo,) and y(ja,) are calculated at a 
new frequency œp. 

As is usually the case with non-linear least-squares optimization 
(which is what Edmunds’ algorithm is really doing), success depends on 
having a sufficiently good initial design Ko to use in (7.37). If the design 
obtained from (7.29) is not good enough, it may be necessary to obtain an 
initial design by some other technique, omit (7.29), and repeatedly solve (7.34) 
and (7.37). Of course, if T, is too incompatible with the properties of G and the 
assumed structure of K, then no reasonable solution exists at all. One should 
therefore ensure that the ‘roll-off’ rates of T, are compatible with those of G, 
that any right half-plane zeros of G are preserved in T,, and so on. 
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A difficult task facing the designer who adopts this approach to 
parameter optimization is that he must choose the controller’s denominator 
polynomials. It is possible to optimize the denominator polynomials as well, 
but it is then no longer possible to approximate the dependence of the fitting 
errors on the controller parameters by a linear relationship, and a consider- 
ably more complicated algorithm results. This possibility has been explored 
by Hung and MacFarlane (1982) in connection with a design based on the 
reversed-frame normalization technique (see Chapter 4). 


7.2.3 A refinement of the algorithm 


A refined way of using Edmunds’ algorithm has been developed by Nett 
(1988). It is designed to overcome the difficulty of specifying a target closed- 
loop transfer function T, which is compatible with the plant — in particular 
when this has right half-plane zeros — and to make it more likely that a 
stabilizing controller is obtained. These improvements to Edmunds’ al- 
gorithm are obtained by combining it with the Youla parametrization, 
described in Chapter 6. 

Suppose that the plant G is asymptotically stable and has at 
least as many inputs as outputs. The closed-loop transfer function 
T= GK(I+ GK)’ can then be written as 


T(s) = G(s)Q(s) (7.41) 
where Q(s) is the (proper, stable) Youla parameter. Let G be factored as 
G(s) = A(s)Go(s) (7.42) 


in which A(s) is a stable and all-pass transfer function whose zeros are the 
right half-plane zeros of G(s), and G,(s) is a square, stable, minimum-phase 
transfer function. That is, we have 


A(s)A*(s) = 1 (7.43) 


(where A*(s) = AT(—s)), and Go ‘(s) has all its poles in the open left half- 
plane. Now consider 


O,(s) = Go (ATO Tal) (7.44) 


where T,(s) is an ideal desired closed-loop transfer function, At denotes the 
pseudo-inverse of A, and 


A(0)At(0) = I (7.45) 


Of course, T,(s) is chosen to be stable, so (7.44) defines a particular Youla 
parameter, provided the high-frequency ‘roll-off’ rates of T,(s) are chosen to 
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be large enough to make Q,(s) proper. This choice of the Youla parameter 
gives the closed-loop transfer function 


T,(s) = G(s)Q,(s) (7.46) 
= A(s)A* (0) Ty(s) (7.47) 


Although this is different from the original desired transfer function T,(s), it 
has exactly the same principal gains, since A(s) is all-pass. That is, we have 


o;LT,(Jo)] = alo] (7.48) 


so that specified robustness properties are preserved at the plant output. Also, 
T,(s) is achievable with the given plant since the required right half-plane 
zeros are built into it, as can be seen from (7.47). The presence of the factor 
A? (0) ensures that 


T,(0) = T,(0) (7.49) 


so that steady-state characteristics (most commonly, 7,(0) = I) are preserved. 

Now T,(s) is used as the target closed-loop transfer function in 
Edmunds’ algorithm, in place of T}(s). The corresponding target controller is 
given by 


K,(s) = Q,(s)LI — G(s)Q,(s)]~* (7.50) 
= Go *(s)A*(0) Ty(s)[I — A(s) A‘ (0) Ta(s)]* (7.51) 


The point of using Edmunds’ algorithm, of course, is to achieve a 
closed-loop response which is close to 7,(s), while using a controller which is 
significantly simpler than K,(s). Since K,(j@) can be evaluated at any set of 
frequencies, a useful procedure is to display the elements of K,(j@) in 
Nyquist or Bode form and to determine the dynamic complexity of each 
element of the controller K(s), and its pole positions, using these displays as a 
guide. Edmunds’ algorithm should then be able to tune the numerator 
coefficients of K(s) to obtain a good approximation to K,(s). Note that this 
approach retains the freedom to give each element of K (s) a different dynamic 
structure, and to constrain certain elements of K(s) to be zero. It also gives the 
designer the opportunity to use his knowledge of the plant, instrumentation 
and actuators to make sensible compromises between what is demanded by 
K,(s) and what can be achieved in practice. 

In order to implement this refinement of Edmunds’ algorithm, it is 
necessary to have a way of finding the factorization (7.42). The central idea of 
an algorithm for doing this can be found in Exercise 6.9, which gives a way of 
finding state-space realizations for coprime factors M and N of G, such that 
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G = M~'N and NN* = I. Here we need to obtain coprime factors M and N, 
such that G= NM! and NN* =], for then we can take A(s) = N (s) and 
G(s) = M ~+(s), since both M (s) and M ~ ‘(s) are stable (recall that we assume 
G to be stable). 

Suppose now that G is square and has a minimal realization (A, B, C, 
D). A dual procedure to that given in Exercise 6.9 will then give coprime 
factors M and N, such that NN*=TJ. Specifically, these factors have 
realizations 


M: (A + BF, B(D™D)~1?, F, (DTD) +°) (7.52) 
and 

N: (A+ BF, B(D™D)~ 1/2, C+ DF, D(D1D) +°) (7.53) 
where 

F =—(D'D)'(B'X + D'C) (7.54) 


A + BF has all its eigenvalues in the open left half-plane, and X is a solution 
of the Riccati equation 


ATX + XA — (XB + C7D) (DTD) (BTX + D™C)+C'C =0 
(7.55) 


For these realizations to exist D must be non-singular, which will often not be 
the case. But it is usually possible to make a small modification to D which 
renders it non-singular, but which makes very little difference to the plant’s 
behaviour over the range of frequencies of interest. 

An approach to feedback design which bears some resemblance to 
Edmunds’ algorithm has been proposed recently by Boyd et al. (1988). Since 
any closed-loop transfer function is linear in the Youla parameter (see Section 
6.4.4), it is also linear in the numerator elements of this parameter. So if the 
denominator elements of the Youla parameter are fixed somehow, and the 
closed-loop transfer function is ‘tuned’ in the least-squares sense, then a linear 
least-squares problem again results. One can now guarantee that the con- 
troller obtained is stabilizing, since the poles of the Youla parameter can be 
chosen to lie in the left half-plane. But the transformation from the Youla 
parameter to the corresponding controller is a complex one, and it is not easy 
to impose any desired structure on the controller by using this approach. 


7.2.4 Design example 


To illustrate the use of Edmunds’ algorithm we shall apply it to the AIRC 
model used in earlier chapters and described in the Appendix. We shall aim to 
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achieve the target closed-loop transfer function 


l 3 \2/3 \/ 0 
AG) aiag| (25) (5) (55) } (759) 


This specification calls for a faster response of the pitch angle than of the 
altitude or forward speed, and is more realistic than our earlier requirement 
of a bandwidth of 10 rad s~ + in response to all reference signals. The diagonal 
structure of T,(s) implies a complete lack of interaction between outputs. 
The target closed-loop behaviour 7;(s) corresponds to the controller 


K,(s) = GTO — T] (7.57) 


and the frequency response of this is displayed, in Bode magnitude form, in 
Figure 7.1. This shows that quite simple controller dynamics are sufficient to 
achieve the target closed-loop behaviour. 

Integrators are required in columns 2 and 3 of K(s), namely on the 
speed and pitch-angle errors, but not in column 1, namely on the altitude 
error. From Figure 7.1 the pole positions required for the various elements of 
K(s) are as follows: 


(1, 1): —8 (1, 2): 0, —4 (1, 3): 0, —30 

(2, 1): —6 (2, 2): 0, —6 (2, 3): 0, —30 

(3, 1): —6 (3, 2): 0, —6 (3, 3): 0, —30 
Placing the pole of k, ,(s) at —6, and the poles of k,,(s) at O and —6, assigns 
the same set of poles to each element in any one column of K(s) and thus 


allows a simpler (five-state) realization to be obtained. 
The following controller structure was therefore chosen: 


vo s+ 
k.: =t" u 
i1 (S) aay (7.58) 
vh s? + vis + v2 
k. ae i2 i2 i2 
i2(s) =" err (7.59) 
vos? + vis + v2 
k. = i3 i3 i3 
i3(S) aaa ree a (7.60) 


Applying Edmunds’ algorithm (modified to allow distinct denominators) to 
this, with equal weight placed on the achievement of each element of 7,(jq) at 
each frequency œ;, and with a set of 50 frequencies spaced logarithmically 
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EDMUNDS’ ALGORITHM 


1073 1077 10-* 10° 10' 10? 


Frequency (rads~ +) 


(a) 


1073 107? 107! 10° 10! 10? 
Frequency (rad s~ +) 
(b) 
Figure 7.2 Gain of (a) 7,,(j@) or 7,,(jo), and (b) T,3G@). 
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Table 7.1 Numerator coefficients for tuned controller. 


Column 
Row 1 2 3 
v? =8x 10716 = v3 = — 5.2967 
1 v? =—6.5183  vl,=6x107!5 vj, =6.5509 
vi =—4.1806  v2,=1.9101 v2,=77.930 
v3, =9x 10716 v$a = — 0.6153 
2 y2,=—0.7822  v},=9.0000 vl, =0.6702 
y}, =0.1328 v2, = 0.7134 v2, = 22.989 
y9,=—1x 10715 v9,= —99.88 
3 v2 = —17.300  vi,=9x107'S = ¥33= — 62.41 
vi, =—5.6199 v3. = 5.3316 v2, = 104.81 


between 0.001 and 100rads~‘, yields the numerator coefficients given in 
Table 7.1. A noteworthy feature of this is that the numerators of k,,(s) and 
k33(s) have no dynamics, and k,,(s) has only one zero. 

This controller achieves the target closed-loop transfer function al- 
most exactly. Figure 7.2 shows the Bode magnitude plots of the diagonal 
elements of the achieved closed-loop transfer function T(s). Figure 7.3 shows 


Gain (dB) 


107? 107? 107! 10° 10! 10? 
Frequency (rads~ *) 


Figure 7.3 Gains of off-diagonal elements of T. 
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Time (s) 


Figure 7.4 Responses to step demand on outputs 1 and 2 (solid curves) and 
output 3 (dashed curves). 


the magnitudes of the off-diagonal elements, and it can be seen that these are 
all very small. Figure 7.4 shows responses to step demands on each of the 
outputs: the response of each output is exactly as specified, and there is 
virtually no interaction. The only problem is that, as in the design examples in 
earlier chapters, the controller introduces zeros to cancel the resonant poles 
at —0.018 +0.182}. 

The use of Edmunds’ algorithm has allowed us to achieve a precise, 
albeit simple specification, using a considerably simpler controller than the 
ones which resulted from the LQG or H,, design techniques. 


7.3 The method of inequalities 


If one observes operators controlling a process plant manually, it becomes 
apparent that in many cases their objective is to hold a number of plant 
variables between permitted bounds, and that they are indifferent to the 
detailed behaviour of the variables, provided they remain within those 


bounds. The design specifications can therefore be written as a set of 
inequalities: 


lel Ses t20, i=1,2,...,m (7.61) 


where each e,(t) is some function of time which represents the behaviour of 
one plant variable. 
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_ Zakian (Zakian and Al-Naib, 1973; Zakian, 1979) was apparently the 
first to suggest that many feedback-control design problems should be posed 
as the satisfaction of a set of inequalities, rather than the minimization of some 
objective function with inequalities acting as side-constraints. The shift of 
emphasis from an objective function to a set of inequalities gives a more 
accurate formal representation of many design problems, and leads to an 
iterative design procedure in which the designer changes the ‘trade-off 
between conflicting constraints by adjusting the inequalities, rather than some 
objective function. This is attractive, because it is usually much easier to 
understand the physical implications of changes in constraining inequalities 
than of changes in an objective function. 

Optimization is still used in Zakian’s approach, as will become clear 
shortly, but the shift of emphasis from minimization to satisfaction means 
that the usual ideas on choice of optimization algorithms have to be revised: 
speed of convergence in the neighbourhood of a minimum becomes much less 
important than the likelihood of finding at least one feasible point — namely 
one at which all the inequalities are satisfied, and the extent of information 
given to the user about causes of failure - which can be used to modify the 
inequalities, thus making success more likely at the next design iteration. 

Inequality constraints arise from physical limitations as well as from 
desired objectives. For example, magnitudes and/or rates of change of control 
signals are usually limited, which results in a set of inequalities of the form 


lu(Ol<ni, liO < v5 (7.62) 


Similar inequalities can arise not from real physical limits, but from the fact 
that one’s model of the plant is valid only for certain ranges of variables. 

If p is a vector of controller parameters which are to be designed, with 
a prespecified controller structure, then inequalities of the form (7.61) or (7.62) 
assume the form 


o(p) <9 (7.63) 


where each @¢,(p) is a functional. For example, to get (7.67) we define 


o(p)= max (le,(t, p)|)— €: (7.64) 


O0<t<a 


where e,(t, p) now denotes the function e,(t) which is obtained with controller 
parameter p. If we wish to impose closed-loop stability as a requirement, we 
can define 


;(p) = max Re {closed-loop poles obtained with p} +€ (7.65) 


for some positive real e. To constrain the value of one of the parameters (to 
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avoid excessive gain) we use a simple function of p: 


OP) = |Djl — Pmax (7.66) 


and so on. 

Very many design objectives and constraints can be put in the form 
(7.63), even though it may not be straightforward to express ġ;(p) mathemat- 
ically. In particular, if we have an ‘envelope constraint’, such as 


a(t) < y(t, p) < b(t) (7.67) 


where a(t) and b(t) are functions of time, we can represent it by (7.63) if we 
define 


$(p) = magmak [a(t) — y(t, p), y(t, p) — b(t)] (7.68) 


This is referred to as an ‘infinite-dimensional constraint’, since it can be 
regarded as the ‘limit’ of the set of inequalities 


altı) < y(t, p) < b(t;) 
: (7.69) 
alty) < y(ty, P) < b(ty) 


as N is increased indefinitely. Frequency-domain constraints can also be 
represented by (7.63). Problems of robust design can be represented similarly: 
suppose that the variable y depends on a plant parameter vector ô as well as 
the controller parameter vector p, and we have a ‘robust performance’ 
requirement that 


a(t) < y(t, p, 6) < b(t), for all ĝe A (7.70) 
Then we define 


ġ:(p)= max max max [a(t)— y(t, p, ô), y(t, p, ô)— b(t)] (7.71) 
deA OXt< a 


Let S; be the set of parameter vectors for which the ith functional 
inequality is satisfied: 


S; = {p: 6(p) < 0} (7.72) 


Then the admissible or feasible set of parameter vectors, for which all the 
inequalities hold, is the intersection 


S=(\5; (7.73) 
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Clearly p is an admissible parameter vector if 


max $;(p) < 0 (7.74) 


which shows that the search for an admissible p can be pursued by opti- 
mization, in particular by solving 


min max o;(p) (7.75) 


Zakian suggests an algorithm for this minimization which he calls the 
moving-boundaries algorithm. This operates as follows. We let p* be the value 
of the parameter vector at the kth step, and define 


10 if ġi(p) <0 
G a otherwise uo) 


St = {p: $p) < Ci} (7.77) 


and 
S* = () SF (7.78) 


S* being the set of parameter vectors for which some but not all of the 

inequality constraints are satisfied; it contains both p* and the admissible set 

S. If we find a new parameter vector p*, such that C! < C% for each i (where 

Č% is defined similarly to C*), then we accept p* as the next value of the 

parameter vector; that is, we set p**! =p". 
We then have 


max bi(p**"*)< max $;(p*) (7.79) 


and 
Sesh? te s* (7.80) 


so that the boundary of the set in which the parameter vector is located has 
been moved towards the admissible set, or, rarely, has remained unaltered. 
The algorithm is terminated when each C¥=0, at which point we have 
S* = S. This process is illustrated in Figure 7.5. 

If one or more of the C* persists in remaining positive, this may be 
taken as an indication that the set of design objectives is inconsistent, while 
their magnitude gives some measure of how closely it is possible to approach 
the objectives. 

The satisfaction of infinite-dimensional constraints can be tested only 
approximately, of course, by a discretization such as (7.69), except in those 
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Figure 7.5 The ‘moving-boundaries’ process. 


rare cases in which an exact value of ¢,(p) can be computed. (See Zakian 
(1986) for an example of such a case.) 

We have not yet mentioned the difficult part of the algorithm, which is 
the generation of a trial parameter vector p*, given p". Zakian suggests the use 
of an algorithm introduced by Rosenbrock (1960), and reports it to be 
successful, but this has to be considered a rather arbitrary choice; one could 
use any of a number of hill-climbing algorithms in its place. In particular, the 
use of relatively crude direct search methods, such as the simplex method of 
Nelder and Mead (1965) or one of its later developments (see Swann, 1974, or 
Gill et al., 1981)! is worth investigating, particularly in view of the un- 
importance of speed of convergence previously mentioned in this section. 
Direct search methods have been used successfully by Ng (1988). 

Alternatively, a standard way of dealing with a ‘minimax’ problem 
such as (7.75) is to replace it by the minimization of a smooth objective 
function, with non-linear constraints: 


min q (7.81) 
{pT qt 
subject to (p) <q, i=1,2,...,m. (7.82) 


where the minimization is over a new vector [p" q]", which consists of the 
design parameters p and a scalar g. When formulated in this way, the problem 
can be solved by standard algorithms available in the major libraries of 
optimization software. (See Gill et al. (1981) for more details.) 

At the opposite extreme from direct search methods are the sophisti- 
cated algorithms for ‘semi-infinite’ optimization developed by Mayne, Polak 
and their colleagues for solving a wide range of engineering design problems, 
including those posed by Zakian (Becker et al., 1979; Mayne et al., 1982; 
Polak et al., 1984). These algorithms have more theoretical basis than the 
moving-boundaries method, and hence one can say something about their 
properties. For example, it can be proved that an admissible parameter 
vector p will be found in a finite number of steps, if it exists. However, these 
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algorithms are far more complicated than the moving-boundaries method, 
their implementation being a job for experts only. 


7.4 Multi-objective optimization 


7.4.1 Formulation 


A formulation of the control-system design problem, which has 
close similarities to the method of inequalities, has been proposed by 
Kreisselmeier and Steinhauser (1979, 1983). Instead of a set of inequalities, 
they translate a specific design problem into a set of performance objectives 
{J;(p), i = 1,2, ..., m)}, which are chosen such that if J,(p,) < J;(p,) then 
the design parameter (vector) p; is better than the design parameter p,, as far 
as the objective J; is concerned. This set of objectives can be considered to be 
a vector of objective functions, or a multi-objective function: 


J, (p) 
J(p)=|: (7.83) 
Jin( D) 


The performance specification is satisfied if J(p) < c for some positive 
vector c of ‘thresholds’ (i.e. if J;(p) < c; for each i, and c; > 0). At this stage the 
formulation can be seen to be completely equivalent to that of the method of 
inequalities, so what follows in the rest of this section can be viewed as a 
particular way of solving the problem posed in the previous section. As in the 
Method of Inequalities, the appropriate values of some of the elements of c 
are usually not known in advance, since they depend on qualitative judgements 
about trade-offs between competing objectives. These are dealt with by 
initially being given large values, so that the corresponding objectives play 
little or no role in the initial stage of design. As the design progresses the 
values of these elements of c are reduced until they begin to affect the design. 
The idea is that if this is done in a suitable interactive environment then 
the user can discover what compromises are necessary, and can adjust the 
elements of c accordingly. 


7.4.2 Solution 


The distinctive feature of Kreisselmeier and Steinhauser’s approach is the 
way in which they replace the multi-objective optimization problem 


mie i=1,...,m (7.84) 
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by a smooth conventional optimization problem which, unlike (7.81)—(7.82), 
remains an unconstrained problem. Note that the formulation (7.84) (as 
opposed to a formulation such as min, (J,(p)—c;)) has the effect of normaliz- 
ing each objective by its threshold value, so that not much attention need be 
paid to J,(p) if c; is large, and if (7.84) is replaced by 


min a(p) = max —— Alp) (7.85) 


p i Cj 


In fact, an optimization algorithm used to solve (7.85) deals with one 
objective at a time — the one for which the ratio J,(p)/c; is currently the largest. 

. Now, (7.85) is a non-smooth optimization problem—that is, the 
function «(p) is not differentiable everywhere, but it can be approximated well 
by a smooth problem. Consider the function 


äp) = Tmf È exp| (*2) |} (7.86) 


for some p > 0: &(p) is a differentiable function of the parameter vector p, and 
can be rewritten as 


an Sake win $ ex p| o( 2 - x(n) |} (7.87) 


Now, 

2c) —a(p) <0 (7.88) 
so 

a(p) < a(p) + = (7.89) 
Also, 

a —a(p)=0, for some k (7.90) 
and hence 

a(p) > «(p) + Inf +6], 620 (7.91) 


> a(p) 
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So we have 


a(p) < &(p) < alp) wan (7.92) 


and we see that «(p) is approximated very well by &(p) if p > Inm. 

To summarize, we started with the multi-objective optimization prob- 
lem (7.84), and replaced it by the minimax problem (7.85). Finally, we 
approximate this by the smooth, unconstrained problem 


min %(p) (7.93) 


where &(p) is defined by (7.86). 

This problem can be solved by a standard algorithm for uncon- 
strained optimization. The most appropriate is probably a finite-difference 
quasi-Newton algorithm (Gill et al., 1981), since in most cases it is impossible 
to compute gradient information for &(p). Of course, software must be 
written to evaluate «(p) for each design problem, and such evaluations will 
usually require a considerable amount of computation — for example, simu- 
lation runs with the current parameters. As with the method of inequalities, 
there is no need to find the true minimum of &(p) for a given c; the 
optimization can be terminated as soon as &(p) is sufficiently small. In fact, if 
(7.92) is to ensure a good approximation to «(p), the optimization should be 
terminated while (p) is still considerably larger than (In m)/p. If necessary, 
the elements of c can then be reduced further, and the optimization restarted. 

Although the process of adjusting the threshold vector c is one which 
depends on the designer’s skill and understanding (of the control design 
problem, that is, not of optimization algorithms), one should adopt a reason- 
ably systematic procedure for making the adjustments. One possibility is the 
following. Let pë and c* denote the values of the parameters and the 
thresholds at the kth cycle of the optimization. If J; is to be reduced further in 
the (k+ 1)th step, then set 


a) = J,(p") (7.94) 
If J; is already satisfactory, then set 

Co =a; (7.95) 
This gives a decreasing sequence of thresholds: 


Se So Se (7.96) 


and ensures that a feasible p** + can be found (since p* is already feasible for 
the (k+ 1)th step). This systematic procedure may have to be abandoned 


MULTIOBJECTIVE OPTIMIZATION 349 


QZ 


inable 
YY Yj 


Jz(p) 


3 z 
7 
7 
4 
¢ 
ge Pareto 
optimal 
7\ 45° set 2 


— y 
J, (p) 


Figure 7.6 A Pareto-optimal set with two objective functions. 


when trade-offs need to be made — that is, some element of c* may need to be 
increased in order to allow another objective to be decreased. 

Note that we can make any p feasible by choosing the elements of c to 
be large enough, provided all the objectives are finite. Of course, this way of 
making p feasible is rather artificial and should be avoided if possible, since it 
does not give good initial values of the parameters, and may lead to failure of 
the optimization in the sense that it may be impossible to bring the thresholds 
c; down to acceptable values. It is far better to use any information available 
at the start of the design to choose parameter values which are reasonable, in 
the sense that most of the objectives are nearly met by them (so that most 
threshold values need be increased only slightly), and to increase only a few of 
the thresholds substantially. 

From (7.86) it is clear that reducing any one of the ratios J,(p)/c; and 
leaving the others unchanged results in a reduced value of (p). As a result, 
the optimization problem (7.93) has the desirable property that its solution is, 
approximately, a Pareto optimal solution of the multi-objective problem 
(7.84). This simply means that the solution is such that a reduction in any one 
of the objectives J; can be achieved only by increasing at least one of the 
others. This is illustrated in Figure 7.6 for the case when there are only two 
objectives (m = 2). The figure shows, hatched, the space of possible values of 
J,(p) and J,(p) — the attainable set. Any point in the interior of this set has a 
neighbour for which the values of both J,(p) and J,(p) are reduced. Such a 
neighbouring point is clearly preferable to the original one. But points on the 
boundary of the attainable set do not have this property. Moving from one 
point on the boundary to another requires a trade-off: increase J,(p) and 
decrease J,(p), or vice versa. We should always aim to be on this boundary, 
which in the parlance of multiobjective optimization is called the Pareto 
optimal set. 


350 DESIGN BY PARAMETER OPTIMIZATION 


Changing the thresholds in c has the effect of moving from one 
solution point on the Pareto optimal set to another. For example, in 
Figure 7.6 we obtain the point 1 if c, > c,, the point 2 if c, > c3, and the point 
3 if c, =c,. Of course, these points would be obtained exactly only if the 
optimization algorithm succeeded in finding the global solution to (7.84); in 
practice it may find only a local solution. 


7.4.3 Example 


Kreisselmeier and Steinhauser (1983) describe an application of their 
approach to the flight control of a military aircraft (the F-4C Phantom). The 
aircraft is modelled by a simple three-state model, and has one input — the 
elevator command, and three outputs — the pitch rate, the angle of attack and 
the normal acceleration of the centre of gravity. The states are the pitch rate, 
the angle of attack and the elevator deflection. The model parameters vary 
with the flight condition. 

The controller uses only one of the outputs — the pitch rate — and has 
the structure shown in Figure 7.7. The choice of a reasonable structure is 
crucial to the success of the design technique, and it is important to note that 
the structure of Figure 7.7 was strongly influenced by a familiarity with both 
control theory and the particular model of the aircraft. Essentially, the 
controller consists of three simple lags in parallel. Two of these were intro- 
duced to act as approximate observers for the angle of attack and the elevator 
deflection, and one to provide approximate integral action. Apart from these 
three lags, the controller contains seven gains, one in each path through the 
controller. Thus there are ten parameters to be adjusted altogether. 


Elevator 


Reference command 


Pitch rate 


Figure 7.7 Controller structure for the flight-control example given by 
Kreisselmeier and Steinhauser (1983). 
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Many objectives were rather loosely defined for this problem. It was 
required to design the controller for five different flight conditions, with the 
aircraft parameters and the objectives being different for each condition (but 
the controller parameters were to remain the same under all flight con- 
ditions). There were thus five objective functions defined for the tracking 
behaviour of the pitch rate in response to reference signals, and five for the 
tracking behaviour of the elevator rate. Also, five objectives were defined for 
the behaviour of the pitch rate and elevator rate in response to disturbances. 
Fifteen objectives defined bounds on acceptable closed-loop pole locations, 
and seven defined bounds for acceptable controller gains. This makes forty- 
two objective functions in all! 

We leave the reader to consult Kreisselmeier and Steinhauser’s paper 
for further details of the formulation of the problem, and for an account of the 
resulting design. Suffice it to say here that the design technique was successful 
when applied to this very challenging problem. 


7.5 Conclusion 


The idea that optimization of parameters within a fixed structure should be 
used to design control systems is by no means new. It was applied as early 
as the 1950s, but the state of contemporary computing technology prevented 
its widespread exploitation. For it is only rarely that there is an obvious 
translation of one’s real objectives into a mathematical optimization prob- 
lem. In most cases one’s first formulation of the problem produces un- 
satisfactory, even nonsensical, results, and one has to revise the formulation 
several, perhaps many times before useful results are obtained. This was 
certainly a discouraging prospect in the days when each reformulation 
involved, typically, taking a stack of punched cards to the local computing 
centre, then waiting perhaps 24 hours for the results of a run, finding that one 
or two cards needed changing, and repeating the process until success was 
achieved. 

Today the position is radically different, of course, since one can sit at, 
a terminal or a personal workstation, submit runs using macros with their 
own parameters, and obtain results within a few seconds or minutes, already 
displayed graphically and annotated. Even major changes, such as redefini- 
tions of performance functionals, can often be effected quickly with the use of 
an editor. The prospect of using an iterative design procedure in such an 
environment is not at all daunting, and a notable feature of all three 
approaches we have examined in this chapter is that the formulation of the 
optimization problem is not seen as sacrosanct. Indeed, the user is encour- 
aged to explore possibilities by changing the formulation frequently. 

In the author’s opinion, the major change in control-system design 
over the next few years will be the increasing use of parameter optimization. 
Several factors are likely to produce this change. First, current CPU 
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technology is still capable of several doublings of computing speed. In the 
past few years the speed of commercially available workstations, for example, 
has doubled approximately annually (there have been four doublings during 
the writing of this book). This means that the great majority of optimization 
problems which we may formulate today, even those requiring a huge 
amount of computation in order to evaluate the performance functionals, will 
become solvable in a reasonable amount of time — minutes rather than hours. 
(But it should be emphasized that current, even slightly obsolete machines 
will solve many realistic design problems in only a few minutes or less.) 

Secondly, the development of powerful optimization algorithms, such 
as those due to Mayne, Polak and their colleagues (Becker et al., 1979; Mayne 
et al., 1982; Polak et al., 1984), will undoubtedly continue. This will make 
success with difficult problems more likely, and will enlarge the class of 
allowable performance functionals, as well as having an effect on speed. 

Thirdly, and perhaps most importantly, will be the increasing exploit- 
ation of the modern computing environment. Work is already under way on 
the development of integrated environments which will make it easy for the 
casual or inexperienced user to connect software for simulation and optimi- 
zation and thus formulate a particular design problem, without needing to be 
an expert in either simulation or optimization — and without being an expert 
programmer, either. A relatively unexplored area with much promise is that 
of displaying information to the user which allows him to monitor the 
progress of an optimization exercise. For instance, referring to the example 
discussed in Section 7.4.3, how should one inform the user of progress on the 
optimization of forty-two objectives simultaneously? These questions are 
addressed by Nye and Tits (1986) and Ng (1988). 


SUMMARY 


In this chapter we have presented three relatively straightforward methods 
of optimizing the parameters of a controller for a feedback system, assum- 
ing that its structure has already been chosen. 

Edmunds’ algorithm solves a specific frequency-domain optimization 
problem, using a rather crude algorithm. Nevertheless, when used in- 
telligently it is capable of finding very good controllers of relatively low 
complexity, as the design example showed. 

The method of inequalities and multi-objective optimization are 
both capable of solving very general design problems, and are not inherently 
restricted to the frequency domain. Both can be implemented quite easily, 
using standard optimization subroutines. Their effectiveness depends, how- 
ever, on good user interfaces, and the development of these is an area of 
current research. 


Note 


1. 
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Gill et al. (1981) use the term ‘polytope’ in place of ‘simplex’. 


EXERCISES 


7.1 


7.2 


7.3 


7.4 


7.5 


(a) Show that Edmunds’ algorithm can be used to tune the con- 
troller parameters if a target-sensitivity function S, is specified 
(where S = (I+ GK)~?). 

(b) Show that Edmunds’ algorithm can be used to obtain designs 
which represent a compromise between achieving a specified 
closed-loop transfer function and a specified behaviour of the 
control signals (plant inputs). 

(Hint: Augment the plant with suitable additional outputs.) 


A SISO plant has transfer function 


s—1 
OO) S 


A desired, but unattainable, closed-loop transfer function for it is 


10 
15) = s+10 


Find an attainable closed-loop transfer function T,(s), which has the 
same gain behaviour as T}(s), and the corresponding feedback con- 
troller. Compare the characteristics of the two sensitivity functions 
Sa(s) = 1 — T; (s) and S(s) = 1 — T,(s). 


Note: The following exercises probably require considerable soft- 
ware development. A language such as Matlab should be used to 
keep this within reasonable bounds—for exercises 7.4 and 7.5 use 
software which has easy-to-use optimization facilities available. In a 
class, these exercises are best tackled by teams of two or three people, 
each team solving one exercise. 


Apply Edmunds’ algorithm to solve Exercise 5.3, 5.4 or 5.5. 
Apply the method of inequalities to solve Exercise 5.3, 5.4 or 5.5. 


Apply multi-objective optimization to solve Exercise 5.3, 5.4 or 5.5. 
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CHAPTER 8 
Computer-aided 
Design 


8.1 Introduction 8.4 Software for control 
8.2 Elements of numerical engineering 
algorithms Summary 
8.3 Applications to linear Exercises 
systems References 


8.1 Introduction 


It is apparent that computer assistance is essential if one is to do any 
multivariable analysis or design, and even for SISO problems the use of the 
traditional graphical methods to obtain accurate results can no longer be 
justified (although these methods do have some didactic value). The nature of 
the assistance required can for convenience be divided into two categories: 
numerical algorithms for performing basic tasks such as evaluation of fre- 
quency responses, and higher-level software which makes the numerical 
algorithms available to the user (invariably in an interactive environment), 
allows results to be passed from one algorithm to another, presents results 
graphically when required, and so on. The units in which the higher-level 
software is organized are often referred to as ‘packages’. 

The basic ideas of some numerical algorithms have already been 
described in earlier chapters. In this chapter we shall present some of the 
more important algorithms which are widely used in the analysis and design 
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of control systems, and which have not been described in earlier chapters. In 
particular, we shall describe how to compute frequency responses and 
characteristic loci, how to compute interconnections and inverses of systems, 
how to obtain minimal state-space realizations, how to split a system into its 
stable and unstable parts (and, more generally, into its ‘partial fractions’), 
how to compute the root loci of multivariable systems, how to compute 
transmission zeros and how to compute balanced realizations and low-order 
approximate systems. Finally, we shall have more to say about the solution of 
Riccati equations (in addition to what has already been said in Chapter 5). 
The reader will also find material on algorithms for diagonalizing systems in 
Chapter 4, for computing low-order approximations in Chapter 6, and for 
solving least-squares and other optimization problems in Chapter 7. 

We shall then go on to examine the software actually available for 
control-system design, and how it is likely to develop over the next few years. 

A word of warning is in order. We can only outline the basic ideas of 
each algorithm in this book. Turning these ideas into reliable mathematical 
software requires a major development effort, as the experience of several 
groups, both academic and commercial, has repeatedly shown. 


8.2 Elements of numerical algorithms 


8.2.1 Conditioning and numerical stability 


Since computation with real numbers can be performed with only limited 
precision, and is constrained to use a particular machine’s number system, 
numerical algorithms which operate on real numbers (as opposed to integers) 
almost always give wrong answers, (the only exceptions occurring in (some of) 
those rare instances when both the data and the results are exactly represen- 
table in the machine’s number system). Numerical analysts have therefore 
expended much effort in devising algorithms which are ‘good’ in the sense 
that they do not give unnecessarily large errors, and most of the basic 
algorithms available in modern subroutine libraries, particularly those con- 
cerned with linear algebra, are good in this sense. We cannot say that a good 
algorithm never gives large errors, because even a good algorithm may on 
occasions be used to solve a problem whose solution is-so sensitive to 
specification errors that large errors result, simply from expressing the input 
data with finite precision. Problems which exhibit such sensitivity are said to 
be ill-conditioned. 

While we cannot say that an algorithm never gives large errors, we 
may be able to say that a particular algorithm always gives the exact solution 
of some problem which is ‘close’ to the original problem. Such algorithms are 
called numerically stable or backwards-stable. If we use a numerically stable 
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algorithm on a weil-conditioned problem, then we can be sure that the result 
will be close to the true result (see Exercise 8.1). 

The investigation of the numerical stability of an algorithm is usually 
very complicated, requiring ingenuity as well as attention to a myriad of 
details. But one can pinpoint the common causes of numerical instability: 
subtraction of one large number from another, possibly resulting in a 
completely false result because of limited precision, division by a number 
whose magnitude may be very small (or even zero), and multiplication of 
two small numbers. The unnecessary inversion of matrices is something 
which is particularly to be avoided, since such an inversion may well be ill- 
conditioned; if this occurs inside an algorithm, the algorithm is not likely to 
be numerically stable. An exception to this is the inversion of orthogonal 
matrices (namely those for which T~* = TT), since it can be accomplished 
simply by transposition, without introducing any error. For this reason most 
of the numerically stable algorithms. for linear algebra make much use of 
orthogonal transformations. 

When we analyse feedback systems we frequently require trans- 
formations of complex matrices (usually frequency-response matrices), in 
which case orthogonal matrices are generally replaced by unitary matrices, 
namely those for which T~* = T#. 

Many of the techniques presented in text-books and papers on 
control, which work perfectly well on small problems which can be solved 
exactly by hand, lead to very poor algorithms when used as the basis of 
machine software. Examples are the usual rank tests for controllability and 
observability, and evaluations of exp (At) based on spectral decomposition of 
A. Fortunately much work has been done since about 1975 on developing 
numerically stable algorithms for control applications, and today we can 
perform most of the important computations reliably. The reader is referred 
to van Dooren (1981a, b), Emami-Naeini and van Dooren (1982), Laub 
(1985) and Arnold and Laub (1984), and the references therein, for detailed 
discussions and analyses. 

Successful algorithms in the fields of systems and control depend, for 
the most part, on reliable, numerically stable algorithms for linear algebra. 
We shall now summarize the most important of these, but the reader should 
consult a numerical analysis text such as Golub and van Loan (1983) or 


Stewart (1973) for more details. There is also an excellent summary in Gill 
et al. (1981). 


8.2.2 Solution of Ax = b when A is square 


The exact solution is x = A~*b, but explicit computation of A~! gives a 
numerically unstable algorithm. A standard algorithm, which can be proved 
to be numerically stable, uses Gaussian elimination with a so-called pivoting 
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strategy to obtain the factorization 

PA=LU (8.1) 
in which L is a lower-triangular matrix, U is an upper-triangular matrix, and 
P is a permutation matrix -namely one whose elements are all zero, except 
for exactly one 1 in each column and row. (Numerical analysts call (8.1) an 
LU factorization of A.) Then the equation 


Ly = Pb (8.2) 


is solved for y, which can be done easily by substitution since L is lower- 
triangular, first solving for y,, then for y,, and so on. Finally the equation 


Ux =y (8.3) 
is solved for x, which again can be done easily by back-substitution since U is 
upper-triangular. Substituting (8.3) into (8.2), and noting that PT = P~}, 
gives 

PTLUx =b ' (8.4) 
which, from (8.1), is the same as 

Ax=b (8.5) 
We shall denote the solution of (8.5) obtained using this algorithm by 

x=A\b (8.6) 


For this problem we can estimate the conditioning. In Section 4.10 the 
condition number of A was defined as 
(A) 

o(A) 


cond(A) = (8.7) 


where o(A) and a(A) are the smallest and largest singular values of A, 
respectively. Note that cond(A) > 1. Suppose that b is perturbed to b + ob, 
and that the resulting (exact) solution then becomes x + 6x. It can be shown 
that 


ext cona(ay a (8.8) 


III lb | 
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If A is perturbed to A + 6A (and b remains unperturbed), then 
lêxl || 6A |i 


= d ay ea 8.9 
[etext Nal #2 


Thus the position with solving linear equations is very fortunate: we 
have a numerically stable algorithm, and we can tell whether a particular 
equation is well conditioned or not. (A useful rule of thumb is that the 
machine may lose the last logo cond (A) decimal places of a solution because 
of round-off errors during the Gaussian elimination.) 

A slight variation of (8.5) which arises commonly in control analysis is 
the linear equation 


AX =B (8.10) 


where A is square, of dimension n x n, and X and B are matrices with m 
columns. In this case we simply regard (8.10) as m equations of the form (8.5): 


Ax, = b,, Ax, = b2,..., AXm = bm (8.11) 


and solve each one as before, except that the LU factorization (8.1) need be 
found only once. We shall use the notation 


X=A\B (8.12) 
to denote the solution of (8.10) found by this method. This solution requires 


about n* + mn? floating-point operations (‘flops’). 


8.2.3 Householder transformations and QR factorization 


A Householder transformation is a matrix of the form 


H 
2w, Wy 


=I- 
CAE 


(8.13) 


where w, is any non-zero vector. Such a transformation is unitary, as is easily 
checked. Consider a vector 


aik 


a= l (8.14) 
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and 


(8.15) 


(8.16) 


T,.a,= 


and 
(8.17) 


Ty +1 (Tide) = Tex 
This shows that, by applying a sequence of such Householder trans- 


formations, any n x n matrix A can be reduced to upper-triangular form: 
(8.18) 


Tq digas T,A=R 
where R is upper-triangular. If A is an mxn matrix, m>n, and 


rank (A) = n, then the same process gives 
(8.19) 


R; 
T -1 Oe Ne 
n 1 1 El 


where R, is again upper-triangular, and R, is non-singular. 


Writing 
(8.20) 
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and recalling that Ty | = TH, we have 


A=Q | | (8.21) 


which is known as the QR factorization of A. 

If m >n and rank(A) =r < n, then an additional permutation of A 
may be necessary to obtain a QR factorization, so in this case we have, in 
general, 


AP =QR (8.22) 


where 
R= E i (8.23) 


R,, is anr x r upper-triangular matrix, and P is a permutation matrix; Q is 
unitary, as before. 
We shall write 


LQ, R] = QR(A) (8.24) 


to denote that A = OR, and 


LO, R P] = QR(A) (8.25) 


to denote that AP = QR. 

Most users do not have to concern themselves with the details of 
Householder transformations, since they are contained within subroutines. 
They have been described here because some knowledge of them is necessary 
for understanding the algorithms for partial-fraction decompositions and 
Riccati-equation solutions that will be described later. 

The principal use of the QR factorization is in solving linear least- 
squares problems in which A has more rows than columns, and we wish to 
find the vector x that minimizes || Ax — b||,. If A has full column rank then 
(since ||. ||. is invariant under unitary transformations) 


|| Ax — bil, = |Q"(Ax — b) II, 


ipe 


(8.26) 


2 
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If we write Q = [Q;, Q2], with as many columns in Q, as there are rows in 


R,, then 
Hb 
Ax — bl. = | le a | (8.27) 
2 
which is clearly minimized by 
x= R \QÏb (8.28) 


Since R, is upper-triangular, this is easily computed by back-substitution 
once the factors Q and R have been obtained. 

If the rank of A is smaller than the number of columns, then there are 
either no solutions or infinitely many solutions of the equation Ax = b. The 
QR factorization can be employed to find the solution (if it exists) for which 
||x||, is minimized. Of particular interest is the ‘underdetermined’ problem 
which occurs when A has fewer rows than columns. To solve this we first find 


[Q, R, P] = QR(A’) (8.29) 
Then we have 

IxTAT — b" ||, = |xTQR — b'P |l2 (8.30) 
since PT = P71. Now we find z" such that 

ZT[R;1; R2] = b'P (8.31) 
(recall (8.23)), the solution being found by substitutions since R,, is upper- 


triangular. If R,, is such that no solution for z exists, then Ax = b has 
no solutions. If a solution for z is found, then we have 


IxTAT — BT], = [|x™[O,,Q.IR — 2° [Ri Rial (8.32) 
= (x7 Q, = z")ERi, Rı2]ll2 (8.33) 
=0 
if 
x= Oz (8.34) 


8.2.4 The Hessenberg form of a matrix 


Any square matrix A is similar to a matrix H which is in upper Hessenberg 
form, namely such that h;; = 0 for i — j > 2: 
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hy, hi2 his oie hin 
hy, hy» h23 misri han 

H = 0O hz, hs os : (8.35) 
0 Ea A o ete 


0 hy, n-t hyn 


One can obtain H from A by using a unitary similarity transformation: 
H = TAT” (8.36) 


where T" = T~', and this transformation can be effected in a numerically 
stable way, again by obtaining T as a finite sequence of Householder 
transformations. 

In fact, if we define w, as 


0 
} k 
0 
We =| Ak+1,k Fk (8.37) 
dk + 2,k 
Ank 
with |r,| = ||@, || as before, and define T, as in (8.13), then it is easily checked 
that 
Tn-2--. DLT ,ATÌ T]... TE, HH (8.38) 


We shall use the notation 
LT, H] = Hess(A) 


to mean that H is the Hessenberg form of A, and A = THT", and T#T = I. 


8.2.5 The Schur form of a square matrix 


Every square matrix A is unitarily similar to an upper-triangular matrix 
which is sometimes referred to as the Schur form of A. Since the diagonal 
elements of this matrix are the eigenvalues of A, the Schur form in general has 
complex elements, even if A is a real matrix. 
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If A is real, it is also possible to obtain a real Schur form in which all 
the elements are real, but which is not quite triangular: 2 x 2 blocks appear 
on the diagonal of the real Schur form, and each such block is associated with 
a complex-conjugate pair of eigenvalues. We shall call such a form quasi- 
triangular. 

The Schur form cannot be computed by any finite algorithm, and 
must therefore be obtained approximately by an iterative algorithm. The 
standard means of obtaining the Schur form is the QR algorithm, which in 
essence has the following form: 


i:= 0; A= A; 


repeat 


choose g; somehow; 
[Q,, R;J:= QR(A; — c;l); 
Aj +1:= RQ; + a;l; 
i= f+ 1; 

until A; is in Schur form. 


It is easy to show that each A, is similar to the original A: since 


A; — ol = Q;R; (8.39) 
we have 
R; = QQ} (A; — 9,1) (8.40) 
and hence 
A;+1 = O8(A; — 6,1)0; + a1 (8.41) 
= 94,0, (8.42) 


From this it follows that 
A =Q}: ... Q7 QAQ Q: see Opsi (8.43) 


so similarity is preserved. 

It is not so easy to see why this algorithm should converge to 
anything, let alone to the Schur form, and we refer the reader to Golub and 
van Loan (1983) for a detailed exposition. For our purposes we need to note 
the following facts. If A is first transformed to Hessenberg form, so that 


Ay = H = TAT# 
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(compare equation (8.36)), then each subsequent A; will retain the Hessenberg 
structure. If such an A; has no zeros on its first sub-diagonal, then choosing 
the shift c; to be an eigenvalue of A (A, say) will result in A;,, having the 
structure 


ösi x X XX 
Aas (8.44) 


x X X 


0 0 2 


That is, the eigenvalue will be exhibited in the (n,n) element, and the 
(n,n — 1) element will be zero. Of course, the shift cannot be chosen to be A, 
since we do not know the eigenvalues of A, but it has been found that the 
algorithm will lead to convergence to a matrix of the form (8.44) for quite a 
wide choice of shifts o;. The best choice, which usually leads to the fastest 
convergence, is obtained as follows: find the eigenvalues of the 2 x 2 block in 
the bottom right-hand corner of A,, 


ee Ge 
| y ri | (8.45) 


nn-1 n,n 


and choose o; as the eigenvalue which is closest to aż „. When converg- 
ence has been obtained so that a$, „-; = 0, the algorithm is applied to the 
(n — 1) x (n—1) matrix formed by the first n — 1 rows and columns of 
A,, and the next eigenvalue of A is eventually exhibited in the (n — 1, n — 1) 
element. This is repeated until the whole Schur form has been obtained. 

If A is real, a variant of this algorithm which uses two shifts at each 
iteration, is commonly used to obtain the real Schur form and to avoid 
complex arithmetic altogether. (See Golub and van Loan (1983) for details.) 


Suppose that the (complex or real) Schur form of A is partitioned so 
that 


Sip. 8 
Q"AQ=S= E A (8.46) 


where S,, and S,, are themselves in Schur form, and have no eigenvalues in 
common. If we partition Q conformally, so that Q = [Q, Q], then 


AQi =Q; 5; (8.47) 
The columns of Q, therefore span an invariant subspace associated with the 


eigenvalues of S4, . Since Q is unitary (orthogonal in the case of the real Schur 
form), the columns of Q, form an orthonormal basis for this invariant 
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subspace. This subspace is also spanned by the eigenvectors associated with 
the eigenvalues of S,,; but, whereas eigenvectors are difficult to compute 
reliably, the columns of Q are obtained from the numerically stable QR 
algorithm. For many applications it is enough to find this orthonormal basis 
of an invariant subspace, rather than the eigenvectors themselves. 

We shall use the notation 


[Q, S] = Schur(A) 


to mean that S = Q' AQ, S is in real Schur form, and QTQ = I. 


8.2.6 Eigenvalues 


The standard way of finding eigenvalues is to find the Schur form as described 
in the previous section, and then to read off the diagonal elements. Note that 
there is no particular order to these elements, although there is a tendency for 
the largest eigenvalues (in absolute magnitude) to be in the top left-hand 
corner of the Schur form, and the smallest ones to be in the bottom right- 
hand corner. 

Although the QR algorithm is numerically stable, it is possible for 
eigenvalues to be ill-conditioned. Roughly speaking, two eigenvalues are ill- 
conditioned if they are very close to each other. (However, the remaining 
eigenvalues may be well-conditioned in such a case.) 


8.2.7 Singular-value decomposition 


For every matrix A of size m x n and rank r, one can obtain the singular- 
value decomposition 


A=UxVH (8.48) 


in which U and V are unitary matrices of size m x m and n x n, respectively, 


and 
2, 0 
S| 8.49 
rad (849) 
where X, = diag{o,,0,...,0,}, and c, 2>0,>... >60, > 0. This finds 


applications in analysis and design, as we have already seen in Chapters 3 to 
6, but is also useful as a constituent of several system-theoretic algorithms. 

In particular, one can use the matrices U and V to obtain a row 
compression of A: 


R 
UYA = | | (8.50) 
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or a column compression of A: 
AV=[C 0] (8.51) 


as is apparent from (8.49). Here R has r rows, all of which are linearly 
independent (R has full row rank) and C has r columns, all of which are 
linearly independent (C has full column rank). 

The diagonal elements of ÈE, are called singular values, and the 
columns of U and V are called the left and right singular vectors, respectively. 
The value of ø, gives a measure of how close the matrix A is to a matrix of 
rank r — 1, in the very precise sense that if the matrix A + E has rank r — 1 
(or smaller), then || E ||, > c, and an E exists for which the equality holds. 
Examination of the singular values therefore gives a reliable way of esti- 
mating the rank of a matrix. In practice the singular values are unlikely to be 
exactly zero, but one normally considers them to be zero if their magnitudes 
are comparable to the precision of the machine one is using (about 2 x 107 16 
in implementations of IEEE Standard arithmetic). 

Another reason for the usefulness of the singular-value decomposition 
is that a pseudo-inverse of A is easily obtained from it: 


yo! 
a= ae (8.52) 


This has the properties AAtA = A and AAA? = A’. If r = m=n then we 
have A`!=VE tU". 

The standard way of computing the singular-value decomposition 
makes use of the OR algorithm, but we do not need to know the details (see 
Golub and van Loan, 1983). The algorithm is numerically stable, and the 
singular values are always well-conditioned. Note that, although the singular 
values are the square roots of the non-zero eigenvalues of A"A, good 
algorithms never form this product explicitly, since this could cause an 
unnecessary loss of precision. 


8.2.8 Generalized eigenvalues 


If A and B are square, n x n matrices, and the scalar 4 and the vector x are 
such that 


Ax = ABx (8.53) 


then 4 is a generalized eigenvalue and x a generalized eigenvector of the pair 
(A, B). We shall denote the set of all generalized eigenvalues of (A, B) by 
A(A, B). (Note that 1(A, B) 4 A(B, A).) If rank (B) = n, then A(A, B) has n 
elements, otherwise it may be empty (that is, there may be no solutions), finite 
or infinite. 
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It can be shown that one can obtain a generalized Schur decomposition 
for every pair (A, B): there exist unitary matrices Q and Z such that 


Q¥AZ=T and Q4BZ=S (8.54) 


where T and S are both upper-triangular. Generalized eigenvalues are easily 
obtained from T and S. If s; # 0, then the ith generalized eigenvalue is given 
by 


A,(A, B) = a (8.55) 


ii 


However if, for some i, t; = s;; = 0, then the set of generalized eigenvalues is 
the whole complex plane. (If t; 40, then y;(B, A) = s;;/t,; is the ith gen- 
eralized eigenvalue of the pair (B, A).) A small value of |s,;| in (8.55) indicates 
that 1,(A, B) is ill-conditioned. 

The generalized Schur decomposition of the pair (A, B) can be ob- 
tained by the QZ algorithm, which is a generalization of the QR algorithm, 
and is numerically stable. (For details see Golub and van Loan (1983).) 


8.3 Applications to linear systems 


8.3.1 Frequency-response evaluation 


Evaluation of the frequency response of a system requires the computation of 
the transfer-function matrix G(s) at a number of points on the complex plane. 
If we have a continuous-time model these points are on the upper half of the 
imaginary axis, and for a discrete-time model they lie on the upper half of 
the origin-centred unit circle. (If w-plane rather than z-plane methods are 
employed for discrete-time systems, then the evaluation is again on the 
imaginary axis, after an appropriate transformation of the system model. 
See Franklin and Powell (1980) for details.) 

If the system definition is already in the form of a transfer-function 
matrix, so that each element of the transfer function is already expressed as a 
function of the complex variable s, then the evaluation is, in principle, 
straightforward. Each element is simply evaluated at each of the required 
frequencies. If the system has / inputs and m outputs, and each element is a 
ratio of two polynomials of degree d, then this evaluation requires 4/m(d + 1) 
(complex) floating-point operations at each frequency. If it is known that all 
the denominator polynomials are common, the number of operations can be 
reduced to Im(2d + 1) + 2d. 

A drawback of this straightforward approach is that accurate evalu- 
ation of polynomials of high degree can be surprisingly difficult (Wilkinson, 
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1965). It may therefore be preferable to obtain a state-space realization of the 
transfer-function matrix first, and then apply the algorithm described in the 
following paragraphs. But this is not a foolproof method either, since 
algorithms for obtaining realizations are not numerically stable. Fortunately, 
transfer-function representations are usually employed only for low-order 
models, which do not usually lead to numerical difficulties. 

Most frequently, systems are defined by state-space models rather 
than transfer functions. The obvious way of computing the frequency re- 
sponse is to convert the model to a transfer-function representation first, then 
proceed as above. This is not to be recommended, however, because the only 
reliable way of performing the conversion to transfer-function form involves 
computing the frequency response first! Fortunately one can compute the 
frequency response directly from the state-space model, using an efficient and 
numerically stable algorithm proposed by Laub (1981). 

We are given the matrices A, B, C and D of a state-space model, and 
we recall that the expression for the corresponding transfer function is 


Gis) = C(sI — A)1B+D (8.56) 


We first obtain the Hessenberg form A of A, and the corresponding unitary 
transformation T: 


[ 7, A] = Hess(A) (8.57) 
so that 

A=T*AT (8.58) 
We now define 

B=T#B (8.59) 
and 

C€=CrT (8.60) 
and note that (A, B, Č, D) is an alternative state-space realization of the 
original system. 

Now we compute 

G(s) = C[(sI — A)\B]+D (8.61) 
at the required frequencies, the matrix [(s! — A)\B] being computed (at 


each frequency) as described in Section 8.2.2. The point of the initial trans- 
formation to Hessenberg form is that sI — A is in Hessenberg form at each 
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value of s, and this reduces the number of operations required to compute 
[(s! — A)\B] from about n? + In? to about In?, where n is the number of 
states. If we add the number of operations required to perform the multiplica- 
tion of C and addition of D, we see that we need about In? + Im(2n + 1) 
operations at each frequency. Since a typical requirement is to compute a 
frequency response at 50 frequencies or more, the cost of the transformation 
to Hessenberg form (about 2n? operations), which is performed only once, is 
usually very minor. 


8.3.2 Characteristic loci and principal gains 


The characteristic loci of a square system are just the eigenvalues of the 
frequency response G({ja), and are evaluated at the same set of frequencies as 
the frequency response. This is done by transforming the frequency response 
to Schur form, as described in Section 8.2.5. 

If the characteristic loci are plotted by joining corresponding eigen- 
values at successive frequencies by line segments, the display may show loci 
crossing at points which are clearly inappropriate. This is because ‘corre- 
sponding’ eigenvalues sometimes appear on the diagonal of the Schur form in 
different positions at different frequencies. The effect is illustrated in Figure 
8.1(a) for a pair of characteristic loci. If the continuous branches of the 
eigenvalues are labelled 4, (jœ) and A, (jæ), then Figure 8.1(a) results from the 
eigenvalues appearing in the Schur form in the order (1, , 43) at frequency «,, 
(A,, 4,) at œz, and (A,, 2,) at œw. To obtain the ‘correct’ display, as shown in 
Figure 8.1(b), requires sorting the eigenvalues before displaying them. Such 
sorting requires a significant amount of computation. 

The standard scheme of computing characteristic loci by transforming 
to Schur form makes no attempt to exploit the fact that the eigenvalues are 
continuous functions of frequency. Making use of this information should 
reduce the number of iterations of the QR algorithm required to obtain the 
Schur form, since good initial estimates of the eigenvalues are already 
available from the previous frequency. It should also be possible to ensure 
that eigenvalues are found in an order which corresponds to continuous 
branches, without explicit sorting. The author has experimented with various 
modifications of the QR algorithm (for example, using eigenvalues found at 
the previous frequency as shifts, or using the accumulated unitary trans- 
formations from the previous reduction to Schur form), but so far without 
success. The problem is that, although these modifications occasionally 
produce superior performance, for typical frequency responses the standard 
QR algorithm is very hard to beat: it normally finds the Schur form in only 2m 
or 2m + 1 iterations, where m is the dimension of the transfer-function 
matrix. (This figure may not hold for large m, but it should be borne in mind 
that m is rarely as large as 5, and probably never as large as 10.) The 
published literature apparently contains no contribution to this problem. 
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Im Im 


X Ar (jo) 


A 
X Az (ja) 


Figure 8.1 Segments of two characteristic loci, (a) without sorting into continu- 
ous loci and (b) with sorting. 


Principal gains (singular values of frequency responses) are usually 
computed using the standard algorithm -see Section 8.2.7. This algorithm 
produces them already ordered by magnitude, so there is no problem in 
displaying them on a Bode plot. 


8.3.3 Interconnections of systems 


Operations which are frequently performed on systems are series, parallel and 
feedback connections. 

If the systems are defined by their frequency responses, and the 
frequency response of the connection is adequate for one’s needs, then all that 
is required is the evaluation of the frequency response at each of a set of 
frequencies. For example, the response of the feedback connection of G(s) 
with H(s) (see Figure 8.2) is computed as 


T (jw) = [I + G(jw)H (jo)]\ G(ja) (8.62) 
or 


Tjo) = (LI + (jo) G(jo)]" \ G(jo)")" (8.63) 


Figure 8.2 Negative-feedback connection of two systems G(s) and H(s). 
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Of these, (8.63) is to be preferred if G(s) has fewer inputs than outputs, since 
the dimension of J + HG is smaller than that of I + GH; otherwise (8.62) is 
better. 

Since the manipulation of algebraic representations of transfer func- 
tions is very unreliable and therefore best avoided (except possibly in a 
symbolic-computation environment), frequency-response representations are 
inevitable when working with systems which have irrational transfer 
functions~most commonly, systems with time delays. But if state-space 
models are available (or can be obtained from rational transfer-function 
models), then it is much more efficient, and usually more useful, to compute 
the state-space representations of the connected system. We shall concentrate 
on doing this in the rest of this section. 


SERIES CONNECTION 


We suppose we are given two systems, defined by 


X, = Á xı + Biu (8.64) 

yı = Cix; + Diu (8.65) 
and 

X, = ÁX, + Bru, (8.66) 

yz = Cx + Du, (8.67) 


and we connect them in series by setting 

u, = yı (8.68) 
Substituting (8.65) into (8.66) gives 

ž = ÁX + B,C xı + BD, u; (8.69) 


Putting (8.64) together with (8.69) gives 


xy A, 0 Xi B, 
z 8.70 
h Fe pale bora: se 
and substituting (8.65) into (8.67) gives 


ya = [DCi Ca]| %: |+ papau (8.71) 
2 
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Thus a state-space realization of the series connection is 


5 4, 0 B, 
E ral E | [D,C, C2], DaD, ) 


There may be cancellations between the poles of one system and the 
transmission zeros of the other, in which case this realization will not be 
minimal. Often this does not matter, but if it does then a minimal realization 
can be obtained as described in Section 8.3.5. 


PARALLEL CONNECTION 


To effect a parallel connection between the two systems defined by 
(8.64)—(8.67), we set 


u = Us (8.72) 
and we form a new output 
y=yi ty (8.73) 


It is easy to see that a realization of the resulting system is 


rA, 0 B, 
Ii ba EI [C, C], D, + Da ) 


Again, this realization may not be minimal. 


FEEDBACK CONNECTION 


We keep the two models (8.64)—(8.67) and connect them together as shown in 
Figure 8.2 by setting 


u =r- y i (8.74) 
and 


u2 = Yi (8.75) 


Since we again have a series connection of the two systems, (8.70) holds again, 
and the appropriate output equation is now 


pees on) © | + Dim (8.76) 


2 
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Now, from (8.74) and (8.71) we have 


(I + D,D,)u, =r — [D;C, cai * | (8.77) 
2 


so (8.70) becomes 


aJ lae JE] + Lei, Jeta 
X, B,C, Az || x2 B,D, 
x {r= [DC caj (8.78) 
2 


= A, — BY (X\D2)C, — B,(X\C,) xy 
7 B,[I—D,(X\D,)]C; A, — B,D,(X\C2) fa 
ae 8.79 
+ B,D,/X r (8.79) 
where 
X=I1+D,D, (8.80) 


Here B,/X means (XT\ BT)" (and D,/X similarly), and each of the matrices 
(X\D,) and (X\C,) need be computed only once. The output equation 
(8.76) becomes 


x 
yı = LU — D,(X\D,))C, — peac * | +(D,/X)r (8.81) 
2 
Again, the realization given by (8.79) and (8.81) may not be minimal. Note 
that when, as often happens, D, = 0 and D, = 0, the realization simplifies to 


Ay “SBAX\G)) [Byx 
Es A, H 0 | [c10], 0) 


If the matrix I + D, D, is singular, the feedback connection is not 
well-defined, and incorrect modelling of the two systems is indicated. 


CONNECTION OF GENERAL BLOCK-DIAGRAMS 


At first sight, it would seem that not all possible block diagrams can be 
assembled by series, parallel and feedback connections. For example, the 
system shown in Figure 8.3(a) appears to require some other operation. In 
fact, it is possible to assemble all block diagrams by using one feedback and 
two series connections, and regarding all the subsystems as parts of a single 
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(b) 


Figure 8.3 (a) A system defined by an interconnection of subsystems, and (b) a 
standard representation of this system. 


large system, as shown by the dashed-line rectangle in Figure 8.3(b). A 
feedback connection is then made between this large system and a constant 
(non-dynamic) system in the feedback path, which, for the example shown in 
Figure 8.3, is 


000 0 
000 — 
010 0 
IOI 0 
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(and the identity matrices have appropriate dimensions). Series connections 
with constant systems are needed to distribute the input to the appropriate 
subsystems, and to form the output. 


GENERAL REMARK ON INTERCONNECTIONS 


When several state-space models are connected together by repeated use of 
series, parallel and feedback connections, the computed behaviour of the 
resulting system often differs significantly from its true behaviour: the con- 
ditioning of the problem has been destroyed by the successive connections. 
This tends to happen, for example, when a system which has only slow poles 
is connected to one which has only fast poles. In this case a matrix such as 


A, 0 
B, Cy A, 


has a high condition number, and so has 


A, 0 
_ 
ee Re i 


at low frequencies. Frequency responses may therefore be computed with 
significant errors at certain frequencies. 

Sources of and cures for this problem are not yet properly understood, 
but empirical experience shows that the problem is largely avoided if, after 
every connection, a minimal realization is obtained (see Section 8.3.5) and this 
is then balanced (see Section 8.3.9). Connecting together only minimal, 
balanced models appears to be an advisable precaution. However, trans- 
formation to a minimal, balanced realization destroys any physical signifi- 
cance of the state variables, and this is sometimes undesirable. 


8.3.4 Inverse systems 


Given the system 
X= Ax + Bu (8.82) 
y = Cx + Du (8.83) 


with transfer function G(s), we can obtain a state-space realization for the 
inverse system G 7 +(s) if the system is square, and D is non-singular. In this 
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case we write (8.83) as 

u= —D~1Cx+D~'y (8.84) 
which we substitute into (8.82): 

xX =(A—BD~'C)x + BD“ 'y (8.85) 
Since the required inverse system has y as its input and u as its output, 
it is clear that the required realization can be computed as 
([A — B(D\C)], B/D, — D\C, D\I), where again B/D means (D*\ BTY, 
and the ‘\’ operation is defined as in Section 8.2.2. 

If the system has / inputs and m outputs, and / > m and rank (D) = m, 
then a right inverse D? of D exists such that 


DD‘ =I, (8.86) 


It turns out that we can obtain a realization of the right-inverse system 
Gt(s) such that 


G(s)G*(s) = Im (8.87) 
simply by replacing D~* by D? in (8.84) and (8.85). We can prove this as 
follows. Using our earlier results on series connections, we see that the 


realization of the system formed by connecting the output of G'(s) to the 
input of G(s) is 


A-—BD'C 0 BD? [-c CLI 
_ BDtc A > Bpt > +4 1 m (8.88) 
If we call the matrices in this realization A, B, C and D, and apply the state- 


coordinate transformation 


we obtain the realization 


(TAT, 7-18, C7, Jy) = (| 


BpDt es 
| A | [0 C], In ) (8.89) 


A—BD'C 0 
0 Al 
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for the series connection. Applying the usual formula (8.56), we see that this 
has transfer function I,,, so (8.87) is verified. 

The right inverse D* is not unique. One choice for it is the pseudo- 
inverse which can be obtained from the singular-value decomposition of D, as 
in equation (8.52). Alternatively, if we let X = D*C, so that DX = C, we see 
that this is an undetermined linear equation which can be solved using the 
QR factorization as given by equations (8.29), (8.31) and (8.34). Applying these 
equations in this case, we obtain 


[Q, R, P]=QR(D") (8.90) 
ZUR R2] = CTP (8.91) 
X=Q,Z (8.92) 


To compute BD‘ and Dt itself, we can regard these as B(D1I,,) and (D? In), 
respectively, let Y = D'I,,, so that DY = I„, and proceed similarly. If D 
has a high condition number, then the singular-value decomposition is to be 
preferred. If the system has more outputs than inputs, so that | < m and 
rank(D) = l, then a left-inverse system G*(s) can be obtained such that 
Gt(s)G(s) = L, in the same way. 

When working with continuous-time models, the rank conditions on 
D, which are required for these inverse realizations to exist, do not usually 
hold. Indeed, very often D = 0. (The inverses can still be defined by transfer 
functions, but these are now improper and so do not have state-space 
realizations.) However, one often requires a system which shows inverse 
characteristics over some finite frequency range only. In this case, it may be 
possible to modify D in such a way that the original system’s frequency 
response is almost unchanged over the frequency range of interest, but with D 
having the required rank. This should be done in such a way that D is not too 
ill-conditioned, which should be easy if originally D = 0, but may be difficult 
or impossible otherwise. 

Note that inverse systems are rarely of use as compensators since they 
produce large control signals, amplify high-frequency noise and may intro- 
duce unstable pole-zero cancellations. They can be useful inside design 
algorithms, however. 


8.3.5 Minimal realizations 


Given a state-space model (A, B, C, D), we can obtain a minimal realization 
having the same transfer function by removing first any uncontrollable 
modes, and then any unobservable modes. The theoretical basis of the 
algorithm which we shall describe is the Popov—Belevitch-Hautus (PBH) 
rank test for controllability and observability (Kailath, 1980): 
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PBH rank test: The pair (A,B) is controllable if and only if 
[sf — A, B] has full row rank for all (complex) s. 

The pair (C, A) is observable if and only if [sI — A’ — C™]" has full 
column rank for all (complex) s. 


We shall now describe the staircase algorithm (van Dooren, 1981), which can 
be proved to be numerically stable. 
First obtain a singular-value decomposition of B (see Section 8.2.7): 


B=U,,V# (8.93) 


and hence obtain a row compression of B: 


Z, 
UTB -| 6 | (8.94) 


in which Z, has full row rank. Let 


Y, X 
UHAU, =| 1 ~ : 
1 AU, E A, | (8.95) 


in which X, and Y, have the same number of rows as Z,, and A, and Y; are 
square. Now obtain a singular-value decomposition of B,: 


B, = U,=,V# (8.96) 


and, if B, has neither zero nor full row rank, let 
Z 
UB, = | s] (8.97) 
Now observe that 


I 0 n» xţț ol. 
o Uv || B jlo un] 


Y, X,U, 
Ea oo Gi 
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so that the state-coordinate transformation 


I 0 
U 
ie A 


transforms the pair (B, A) into the pair 


Y, X,U, 

LZ; 

ak Z Y, x, (8.100) 
0 Bee Asd | 


This process can be repeated until eventually it terminates with B, = 0, or B, 
having full row rank. If B, = 0 then the pair (B, A) is eventually transformed 
into the ‘staircase’ form 


Zi I.’ s wx la 
Za * x x ! * 
0 Z, a 
; : po: (8.101) 
0 0 0 Ze Rox, 
í 
0 0 0 0014 


in which each asterisk denotes a matrix whose value is not important. 
If we partition this pair into 


then we see that [ B,, sI — A, ] has full row rank for all s (since the Z; have full 
row rank), and [0, sf! — A, ] loses rank whenever s is an eigenvalue of A,. 
This partitioning therefore separates out the controllable and uncontrollable 
‘parts’ of the pair (A, B). 

To be more precise, let 


I 0 
=| 4 4 (8.102) 


in which the dimensions of J are the same as those of Y; _ 4, and let 


| ne A EE 1 (8.103) 


APPLICATIONS TO LINEAR SYSTEMS 381 


E 


Then 


D —C, * 
B, sI-A | * 
cat OST nee, (8.104) 
I 


so that (A,, B,, C,, D) is a controllable realization with the same transfer 
function as (A, B, C, D). 

Note, in passing, that the row dimensions of the Z, are the con- 
trollability (Kronecker) indices of the pair (A, B), which play an important 
role in multivariable-system identification. Also, the eigenvalues of A, are the 
so-called input decoupling zeros. 

The realization (A,, Be, Co, D) may not be minimal, since the pair 
(C, A) may be unobservable. Since (C, A) is observable if and only if (47, C7) 
is controllable, the algorithm described above can be used to remove the 
unobservable modes. By this means (and by employing conjugate transposes 
in (8.102) as appropriate) the pair [C7 AT] is transformed into the pair 


[C.o 9] 
A. 0 (8.105) 
* A, 
; „ID =C. |. ` 
and the matrix ‘pencil’ is transformed into 
B, sI — 4. 
D -Co | 0 
i 
Boo sl— A.o ! 0 
caer Mein (8.106) 
! 
1 
* * | sl — A, 


so that (A.o, Boo, Ceo, D} is a minimal realization with the same transfer 
function as (A, B, C, D). 

When we compute the minimal realization, we have to decide whether 
certain quantities are zero or not. The rank of each of the matrices 
B, B,,..., B, in equations (8.94) and (8.96) must be determined. This is done 
by examining the singular values in 2,, 23, ... and deciding how many of 
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them are zero, either in an ‘automatic’ way, by comparing their magnitudes 
with the machine precision, or by comparison with a user-defined threshold 
value. 


8.3.6 Partial-fraction decomposition 


We shall now describe how a state-space system can be decomposed into two 
(or more) parallel systems, the eigenvalues (poles) of the original system being 
divided between the two systems, and no new eigenvalues being introduced. 
The most common application of this is to the decomposition of a system into 
its stable and unstable parts. If the original system has state dimension n, and 
all its eigenvalues are distinct (and sufficiently distinct to be numerically 
distinguishable), then it can be decomposed into n parallel first-order systems, 
each corresponding to one of the eigenvalues. Such a decomposition corre- 
sponds to a partial-fraction decomposition of the transfer function. (In this 
case some of the matrices A may be complex, and it may be more useful to 
combine complex-conjugate pairs of eigenvalues into real second-order 
systems.) However, if some of the eigenvalues are repeated then we cannot 
reliably split up the repeated eigenvalues. In this case it is more accurate to 
speak of a ‘direct-sum decomposition’. 

In Section 8.3.3 we saw that one realization of a parallel connection of 
state-space systems has a block-diagonal matrix A. So the problem of 
decomposing a system into a parallel connection of subsystems is equivalent 
to the problem of finding a state-coordinate transformation which results in 
the matrix A being block-diagonal, each of the diagonal blocks having the 
required eigenvalues. This problem can be solved in three steps: first reduce 
the matrix A to Schur form (usually to real Schur form), secondly reorder the 
Schur form so that the eigenvalues (diagonal elements) are grouped as 
required, and thirdly reduce the remaining off-diagonal blocks to zero 
without disturbing the diagonal blocks. 

To describe this procedure in detail, we assume that the decomposi- 
tion is to be into two (real) subsystems. Each resulting subsystem can then be 
decomposed further, if required. First, find a unitary similarity transforma- 
tion Qo which reduces A to real Schur form (see Section 8.2.5): 


An A 
migs ae i 8.107 


where A,, and A,, are upper quasi-triangular, and their dimensions are 
determined by the required state dimensions of the parallel subsystems — for 
example, if A has five eigenvalues, of which two are stable, and a stable/ 
unstable decomposition is required, then A,, has dimension 2 and A,, has 
dimension 3. After this transformation the eigenvalues are distributed ran- 
domly between A,, and A,,, so we now have to find a state-coordinate 
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transformation which distributes them as required (to continue the example, 
the two stable eigenvalues in A,,, the remaining three in A,,). 

We proceed by finding a sequence of state-coordinate transforma- 
tions, each of which interchanges a pair of adjacent diagonal elements of 
QÏ AQo, by means of so-called Givens rotations. A Givens rotation J (i, j, 0) is 
an orthogonal matrix which is almost an identity, but with the elements (i, i), 
(i,j), (j, i) and (j, j) replaced by the elements of the rotation matrix 


ee 


where c=:cos @ and s = sin 8: 


1 
l Cy S < rowi 
J (i,j, 0) = i (8.108) 
-s l Cy <— rowj 
ial 
T 


columni column j 


Suppose that the 2 x 2 block at the intersection of rows and columns k and 
k + 1 of QU AQ, has the form 


ie tek +4 | 
O eerie 


Find @ such that 


cC S Ék k+1 * 
: = 8.109 
E | Pees ee H ( ) 


(where the value of * is unimportant). Then 
Tu T 
J(k,k + 1, 0)QGAQod(k, k +1,0) 


(which remains similar to A) will remain quasi-triangular, it will differ from 
Q§ AQ, only in the kth and (k + 1)th rows and columns, and the 2 x 2 block 
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at the intersection of these rows and columns will have the form 


te+iet1 lent 


If Q4 AQ, is triangular —namely, if all the eigenvalues of A are real—then the 
following algorithm will give the required transformation, if A denotes 
the set of eigenvalues which is to appear in the top left-hand block, and 


T = Q§AQo: 


p:= dim (A41); 
Q= Qo; 
while {t,;, f9,--- +» top} #A do 


begin for k:=1 to n—1do 
if tkf A and t41, 44 46A then 
begin 
find 0 as above; 
Q:= QJ(k, k + 1, @)7; 
T A 
T:=J(k, k+ 1,0) TJ(k, k + 1, 0)* 
end. 


end. 


When this algorithm terminates, we have T = Q" AQ, with T upper-triangu- 
lar, the top left-hand p x p block having A as its set of eigenvalues. 

If A has complex eigenvalues, so that T is quasi-triangular but not 
triangular, then the algorithm becomes more complicated and rather dif- 
ferent in nature. In this case, if we wish to rearrange the eigenvalues of the 
blocks / 


| Ty Dari | 

0O Teer 

where one or both of Tj, T;+1,;+1 are 2 x 2 matrices, one step of the QR 
algorithm (see Section 8.2.5) is performed, with the shift chosen to reverse the 
ordering of the eigenvalues — here we can choose the shift to accomplish this 
reversal, since we already know the eigenvalues of T. The algorithm gen- 
erally used for this, and for performing Givens rotations on 1 x 1 blocks, is 
known as EXCHNG, and is given by Stewart (1976). 
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In practice it is never necessary to isolate single complex eigenvalues, 
so it is never necessary to work with the complex Schur form. But in principle 
it is possible to do so; in this case the Givens rotations have to be generalized 
to unitary matrices of the form 


c S5 
—s c 
in which c is real, and c? + ss = 1. 
We have now obtained 


(8.110) 


o"ao=T=| 7 | 


and must find a further state-coordinate transformation to reduce 7,, to 
zero. If T;, and T,;, have no common eigenvalues then this can always be 
done, using a transformation 


JET? fg 
r=| 4 d (8.111) 


in which the dimensions of the blocks are the same as those in T (Golub and 
van Loan, 1983). Suppose that 


z T, 0 
Y-'Ty=T=| ” : 
| 0 a a 


then 
TY = YT (8.113) 


leads to the Sylvester equation 


Ti, Yi. = Y2 Taz = = £132 (8.114) 


which must be solved for Y,,. 

Since T,, and T,, are quasi-triangular, or even triangular, this equa- 
tion can be solved efficiently by a kind of back-substitution process. The best 
algorithm for doing this is the one proposed by Bartels and Stewart (1972), 
and described by Golub and van Loan (1983). 

Once equation (8.114) has been solved, the state-coordinate trans- 
formation required to decompose the original system is 


R=QY (8.115) 
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so that the decomposition is obtained by simply partitioning the matrices of 
the realization (R~ + AR, RT ‘+B, CR, D). The matrix D is not partitioned, of 
course, but may be regarded as a separate parallel subsystem. Note that, 
although Y is not orthogonal, Y~ ' can be computed stably since 


ry, 
= ee 12 
Y alo ; | (8.116) 


The process of rearranging eigenvalues in the Schur form can lead to 
significant changes in the values of any ill-conditioned eigenvalues. It is 
therefore not advisable to try to obtain a decomposition into subsystems 
which have nearly equal eigenvalues. This can create problems when one is 
trying to separate out the stable and unstable parts of a system, if there are 
eigenvalues which are close to each other but lie on opposite sides of the 
stability boundary. 


8.3.7 Transmission zeros 


We now consider the computation of transmission zeros from the state-space 
realization (A, B, C, D) of a linear system. If D is invertible, these zeros are 
just the eigenvalues of A — BD ~' C, but any attempt to use this fact directly 
for computation leads to a numerically unstable algorithm. It is also of 
limited use, since D is frequently singular. If D is singular then there are fewer 
than n finite transmission zeros (where n is the dimension of A), and zeros 
appear ‘at infinity’. Naive attempts to compute the zeros usually end up 
transforming these infinite zeros to arbitrary locations in the complex plane, 
where their presence may destroy the conditioning of the true (finite) zeros. 
The matrix 


P(s) = oe p (8.117) 


which appeared in equation (8.104) (with its rows and columns rearranged) is 
known as Rosenbrock’s system matrix (Rosenbrock, 1970). It loses rank at 
those points in the complex plane which are either eigenvalues of A corre- 
sponding to uncontrollable or unobservable modes (input or output 
decoupling zeros), or transmission zeros of G(s) = C(sI — A)" 'B + D. If 
(A, B, C, D) is a minimal realization, then the system matrix loses rank only 
at the transmission zeros, so we shall assume that we start with a minimal 
realization (obtained as in Section 8.3.5). 

A numerically stable algorithm for finding transmission zeros is 
obtained by first performing row and column compressions on (parts of) the 
system matrix, in order to remove any zeros at infinity, and then finding the 
zeros by solving a generalized eigenvalue problem. 
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We begin by obtaining a singular-value decomposition of D: 


and using it to compress the rows of D, simultaneously transforming C: 


UTC, D]= la a (8.119) 


If D = 0 we set Cy) = C. 2 
Next we compress the columns of Co, by first obtaining another 
singular-value decomposition: 


Co = PA 07 (8.120) 
C,0,=[0 S,] (8.121) 


(where we have assumed that the singular values in A, have been reordered 
to place the zero singular values in the top left-hand corner, in order to 
compress the columns of Cy to the right rather than the left). 

Let c, be the number of rows in Cy and Do, and p be the number of 
columns of D (that is, the number of system inputs). Now define 


| 0 | 
Sef acid ee (8.122) 


where the blocks denoted by * have the same number of columns as S4; A, is 
square, with the same number of columns as the zero block in (8.121); and B, 
and D, have p columns. 

This whole process is now repeated, starting with (A,, B,, C,, Di) 
where k = 1 initially, to generate the sequence C,, C,,D,, Skis Ans, 
BysisCy+1,Dy41(k = 1,2,...), and is terminated either when D, has full 
row rank, or when C, has either zero or full column rank. If at any stage C, 
has full column rank, then the system has no (finite) transmission zeros, and 
there is no more to be done. 

When either D, has full row rank, or Č, = 0, we set A, := A,, B,:= B,, 
C,:= C, and D,:= D,, and proceed to the dual process described below. Note 
that D, has full row rank. 
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In the jth step of the above process we effectively perform the 
following transformation, if we adopt the convention that (Ay, By, Co, Do) 
= (A, B, C, D): 


k e B,- IQ; 0 
0 Ujit Dral 0 G 
(8.123) 


so that on termination we have transformed the original system matrix to 


slI—A, * * T * B, 
=C; * * weit * D, 
0 a Sy * eae x 0 
: . (8.124) 
0 =S- 
r: * 
0 . 
0 -S, 


This matrix loses rank at the same values of s as the original system 
matrix, since the transformations employed at each step (8.123) are non- 
singular. But each S, has full column rank, so (8.124) loses rank at the same 
points at which 


A es (8.125) 
=. C, D, 


loses rank. In other words, the (finite) transmission zeros of (4, B, C, D) are 
the same as those of (A,, B,, C,, D,). 

We now submit (A,, B,, C,, D,) to the dual process of that described 
above, in which the roles of rows and columns are systematically inter- 
changed. (Equivalently, we could submit (A7, C7, B7, D7) to the same 
process as that above, and apply a corresponding transformation to the 
result.) This will either show that there are no transmission zeros, or termin- 
ate yielding (A,,, Bre; Cres Dye) in which D,, has full column rank, by con- 
struction. But the full row rank of D, is not destroyed by the dual algorithm, 
so D,, must be square and invertible. Hence (A,,, Bre» Cros D,,) has finite 
transmission zeros only, and these are the same as the transmission zeros of 


(A, B, C, D). 
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It remains to find these transmission zeros. For this we need yet 
another singular-value decomposition, this time of [ BY, DI J": 


| By. | =XEY (8.126) 
De 


and we use this to transform the system matrix of (A,,, Bres Cros Drc X 


aaa a 5 | (8.127) 
— Ce De * D 

(where we have again used a reversed ordering of the singular values in &, 
with respect to the usual ordering). Now, this transformed system matrix 
clearly loses rank only when sB— A loses rank-—in other words, at the 
generalized eigenvalues of the pair (A, B) (see Section 8.2.8). That is, the 
generalized eigenvalues of the pair (A,B) are the transmission zeros of 
the original system (A, B, C, D). 

Computation of the eigenvalues of A,,—B,,D;,’C,. has been 
avoided, since D,, may be ill-conditioned with respect to inversion. 

The algorithm described in this section is taken from van Dooren 
(1981a). A faster version, which uses Householder transformations instead of 
singular-value decompositions, is described by Emami-Naeini and van 
Dooren (1982), who give an extensive discussion of the performance of the 
algorithm and compare it with other proposed methods. 

In the course of this algorithm the rank of certain matrices must be 
determined at each step. As discussed in Section 8.2.7, these decisions are 
usually taken by comparing singular values with the machine precision. 
However, there may be grounds for considering larger singular values to be 
zero, for example if one’s data (namely the matrices A, B, C and D) is known 
to be approximate, and information about its precision is available. 


8.3.8 Root loci 


It is occasionally useful to follow the variations of the closed-loop character- 
istic roots when negative feedback of the form kI is placed around a square, 
multivariable system, as shown in Figure 8.4. Here k is a scalar parameter 
which is usually real and takes values in the range 0 < k< œ, or (less 
commonly) —coo<k<oo. From (8.79) we find that the closed-loop 
‘A’ matrix in this case is 


A, = A—B(I+kD)~*kC (8.128) 


if (A, B, C, D) is a realization of the system in the forward path. If we let 
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Figure 8.4 Multivariable root loci are usually investigated for this feedback 
configuration, as the gain k is varied. 


g = 1/k, we obtain 
A, = A— B(gI+ DY 'C (8.129) 


and the closed-loop characteristic roots are simply the eigenvalues of A.. 
Note the identity of form between the two expressions 


A.(g) =A — B(gI + D) +C (8.130) 


and 
G(s)=D + C(sI— A) 'B (8.131) 


It implies that, if we have an algorithm for finding the characteristic loci of a 
system with realization (A, B, C, D), then we can find the multivariable root 
loci by applying the same algorithm to the realization (— D, C, — B, A). (If 
we choose a positive-feedback convention we do not even need to change the 
signs of B and D.) 

We have not assumed (A, B, C, D) to be minimal. Any uncontrollable 
or unobservable modes simply correspond to eigenvalues of A. (g) which do 
not vary with g. This can be used to give a simple visual test for minimality, 
when appropriate display facilities are available. 

For a theoretical investigation of multivariable root loci see 
Postlethwaite and MacFarlane (1979) and the references cited therein. 


8.3.9 Balanced realizations and model approximation 


For a continuous-time, asymptotically stable system, a balanced realization 
(A,, By, Cy, Da) is one for which the Lyapunov equations 


A,P + PAT + B,BE =0 (8.132) 


and 
ATQ+QA,+ CIC, =0 (8.133) 
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have a common solution 
P=Q0=} (8.134) 


and & is diagonal. Such a realization always exists. 

Each state variable is, in a sense, equally strongly coupled to the input 
and the output, in a balanced realization, and this lies at the root of the 
usefulness of these realizations, particularly for model approximation. 

We assume that we are given a minimal realization ( A, B, C, D}, which 
is not balanced. Exercise 6.12 has already given the basis of an algorithm, 
which could be implemented in four steps: 


(1) Solve the two Lyapunov equations, giving solutions P and Q. 


(2) Obtain the factorization Q = RTR; this can be done using Cholesky 
decomposition (Golub and van Loan, 1983). 


(3) Find the singular-value decomposition 

RPR™ =U2U7 (8.135) 
(4) Form the balanced realization 

(A,, By, Cy, Dy) = (TAT +, TB,CT~',D) (8.136) 


where 


ra "PUTR (8.137) 


This algorithm can be improved as follows. In step 1 use the algorithm 
proposed by Hammarling (1982), which never forms the Lyapunov solutions 
P and Q, but only their ‘square roots’, R and S respectively, such that 


P=S™S and Q=R'™R (8.138) 


Hammarling’s algorithm is based on finding the real Schur form of A, then 
using a back-substitution process to solve for R or S, without ever forming 
the products C'C or BB’. The resulting factor is triangular. Use of this 
algorithm clearly avoids the need for step 2, and one can go directly to step 3. 
Here again there is room for improvement, because one now needs the 
singular-value decomposition of the product 


RS’=UxXVT (8.139) 


which is best found by using the algorithm developed by Heath et al. (1986), 
without actually forming the product RST. Finally we set T = £7 1/2 UTR, as 
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before, but we can now obtain T~/ as 
Tas ye (8.140) 


This impreved algorithm was put forward by Laub et al. (1987). 


MODEL APPROXIMATION 


One of the principal uses of balanced realizations is in the approximation of 
state-space systems by simpler ones which approximate the input/output 
behaviour of the original systems. The need for such approximations arises 
constantly when working with multivariable systems. As mentioned at the 
end of Section 6.6, Hankel approximation theory can be used to obtain 
approximations which are optimal in the sense that they minimize the 
Hankel norm of the approximation error, and of course balanced realizations 
play a crucial role in obtaining these approximations. 

But in practice very good approximations can be obtained by using a 
rather simpler procedure (Moore, 1981) in which a balanced realization of the 
original system is obtained, and then simply ‘truncated’ by discarding those 
parts relating to the state variables which are most weakly coupled to the 
inputs and outputs. Suppose that we have a minimal balanced realization of 
an asymptotically stable system G, of McMillan degree N. Let the common 
solution of (8.132) and (8.133) be 


= 0 
y= 
es 


Ly = diag {o,, 02; rrr) a, 


where 


and £, £, have no common eigenvalues. Now let the matrices of this 
realization be partitioned conformally with È: 


A A B 
o:(| = g | a [C Cı], p) 
Ay, A22 B, 
and let G, be the system with realization 


Gy: (A11, Bi, C,,D) 


Then it can be shown that G, is asymptotically stable, (A,,,B,,C,, D) is 
a minimal and balanced realization of G,, and 


a N-k 
|G — Gy llo <2) Ok+i 
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8.3.10 Algebraic Riccati equations 
We have already seen, in Chapter 5, that the continuous-time algebraic 
Riccati equation 


ATP + PA — PBR™'B™P +0 =0 (8.141) 


in which Q20 and R>0O, can be solved by performing an 
eigenvalue—eigenvector analysis of the Hamiltonian matrix 


—-1 pT 
n= drs l (8.142) 


This method is not to be recommended, however, because of the difficulties 
associated with computing eigenvectors. A better algorithm is obtained by 
recognizing that it is sufficient to find a basis for the subspace spanned by the 
eigenvectors which correspond to the stable eigenvalues of H (Laub, 1979). 
Such a subspace can be determined reliably by reducing H to real Schur form, 
and then reordering the eigenvalues in this form. 

In the notation used in Section 8.2.5, we write 


[X, S] = Schur(H) (8.143) 


We then reorder the diagonal elements (and 2 x 2 blocks) of S, using the 
algorithm described in Section 8.3.6, to get the partitioning 


zīşz=| 5u * (8.144) 
0 Sy 


in which all the eigenvalues of S,, lie in the left half-plane, and all those of S23 
in the right half-plane. (Eigenvalues on the imaginary axis are precluded by 
the standard assumption that (A, B, Q'/*) is minimal.) Recall that there are 
equal numbers of eigenvalues in each half-plane. Now let T= XZ, and 
partition T into 


T, 1 | 
T= 8.145 
l Ta T22 iia 
so that 
| Ti | | T; 1 | 
H = S 8.146 
Then the unique positive-semidefinite solution of (8.141) is given by 
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A weakness of this algorithm is that inversion of R is required in 
order to form the Hamiltonian H. If R is ill-conditioned this can introduce 
large errors. To overcome this, van Dooren (1981b) has shown that P can also 
be computed from the generalized Schur decomposition (see Section 8.2.8) of 
the pair (I, H), where 


A 0 B 
=| -0 —A™ 0 (8.148) 
0 BT R 


This avoids the need to invert R. 


8.4 Software for control engineering 


8.4.1 Mathematical software 


A good numerical algorithm is not the same thing as good mathematical 
software. A good implementation is necessary to transform one into the 
other. Attention must be paid to many small details to guard against 
unexpected conditions which may lead to overflows or underflows, to ensure 
that the sequence of computations is ordered so that it does not increase the 
sensitivity of the calculation unnecessarily, that the software will produce 
similar results when processed by various compilers on various machines, 
and to avoid unnecessarily lengthy computation. Even careful attention to all 
these details is of little use without good documentation, and without a good, 
well-defined user interface. Obviously, such documentation should specify 
the data required by the software and the results produced by it. But it should 
also define the class of problems for which the software produces reliable 
results, what happens if the boundary of this class is approached, and how 
failures and difficulties are signalled to the user. 

Many of the ‘building-block’ algorithms which are needed for control- 
engineering software are already available in the form of high-quality 
mathematical software, and have been assembled into coherent subroutine 
- libraries. The best-known of these are the Eispack and Linpack libraries 
for eigenproblems and linear systems of equations, respectively, and the NAG 
(Numerical Algorithms Group Ltd) and IMSL (International Mathematical 
and Statistical Libraries, Inc.) libraries, each of which covers the whole field of 
numerical analysis. 

Similar libraries of routines, but aimed specifically at the needs of 
control engineers, have recently appeared. Examples of these are Slice, which 
was developed in the United Kingdom and is distributed by NAG, and 
Oracls, which was developed in the USA by NASA. Details of these and other 
libraries, and also of other software relevant to control engineering, can be 
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found in the Extended List of Control Software (ELCS), which is a collection 
of one-page summaries of subroutine libraries and software packages, and 
which is revised regularly (Frederick et al., 1987).! The Benelux Working 
Group on Software has also compiled an inventory of software for control 
(WGS, 1985). 

Some tools are available to help the development of good mathemat- 
ical software, such as the PFORT verifier which detects the presence of non- 
standard FORTRAN code in a program. Implementation is also made easier 
by the existence of IEEE Standard 754 for binary floating-point arithmetic 
(IEEE, 1981), and this is being adhered to by increasingly many processors. 
But this still leaves the development of good mathematical software as a very 
expensive activity which should not be undertaken lightly. Of course, this is 
not an argument for the development of low-quality software; that inevitably 
turns out to be even more expensive in the long run. Rather, it implies that 
low-level software development should be undertaken only as a last 
resort -the software you need is probably already available somewhere. For 
most projects, it should not even be necessary to write high-level software 
nowadays, except perhaps macros in some control-specific language. 


8.4.2 Software packages 


Of course, a control engineer (and any other user of software) does not want 
to work at the level of subroutine libraries—he needs a complete working 
program, or set of programs. The nature of control-system analysis and 
design, as of any engineering design process which contains unpredictable 
sequences of cut-and-try activities and relies heavily on graphical indicators 
of performance (time and frequency responses, mostly), dictates that such a 
program should be interactive, and should have graphical output facilities. 

A number of software ‘packages’ now exist which integrate basic 
algorithms with data management, an interactive user interface and graphical 
output capability, and which are aimed specifically at control engineering. 
Such packages were first developed in Europe in the 1970s by university 
research groups. An early example of a very successful family of such 
packages, which still compares well with some more recent offerings, is the 
Intrac-based software developed at the Lund Institute of T. echnology in 
Sweden (Åström and Wieslander, 1981). 

Of more direct interest to the topics covered in this book are the 
software packages which were developed in the UK to support research into 
multivariable frequency-domain techniques. One of these, the Cambridge 
Linear Analysis and Design Programs (CLADP), has now become a properly 
supported commercial product and offers a comprehensive software facility 
for multivariable feedback analysis and design (Edmunds, 1979; Maciejowski 
and MacFarlane, 1982). It provides not only basic facilities such as com- 
putation and display of Nyquist arrays, Gershgorin bands, characteristic loci 
and so on, but also facilities for manipulating polynomials and matrices 
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(which makes it possible to perform LQG design, for example) and useful 
tools for performing tasks such as converting from transfer-function to state- 
space models, or from continuous time to discrete time, or converting block 
diagrams into equivalent single systems. l 

In about 1980 a new family of software packages began to appear in 
the USA. The Eispack and Linpack subroutine libraries had been provided 
with an interactive ‘front end’, named Matlab (Matrix Laboratory), which 
made it possible to enter command lines such as 


V= EIG(A* 8B’) 


at a computer terminal, and obtain the eigenvalues of the matrix product 
ABT almost immediately, stored in the vector variable V. Matlab also 
allowed ‘macros’ to be written, namely collections of Matlab statements 
which could be saved on file and executed at will. Since these statements 
included flow control statements such as 


“f... then... else... 


this effectively constituted a high-level programming language specifically 
tailored to numerical linear algebra. This language was used as the basis of 
the control-oriented packages, CTRL-C (Little et al., 1985) and Matrixx 
(Shah et al., 1985). Basically, these packages added control-specific macros to 
Matlab, added good graphical output facilities, and provided interfaces to 
model-building and simulation software. With one of these packages avail- 
able, the user was able to enter statements such as 


K = OPTREG(A, B, Q, R) 


to obtain the state-feedback matrix which implements a linear—quadratic 
optimal regulator, for example. The resulting time and frequency responses 
could then be displayed easily. 

This approach to producing control software has proved very popu- 
lar. Its major attraction is that the user is faced with a very uniform interface, 
he can easily mix high-level control-specific commands with low-level nu- 
merical or graphical commands, and he can easily tailor the package to his 
own needs by writing appropriate macros. Whereas the original Matlab was 
freely available in the public domain, a new commercial version is now 
available (for personal computers, called PC-Matlab, and for minicomputers 
and workstations, called Pro-Matlab). In this version, great attention has 
been paid to making it possible for the user to tailor the software for his own 
specific needs, and to extend its capabilities. For example, it is very easy to 
write one’s own macros, with formal input and output parameters, some of 
which may be omitted and replaced by default values. Also, this version has 
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simple but powerful graphics facilities (plot (c) will plot the vector c of 
complex numbers on the complex plane, for example), and the code has been 
optimized so that the speed of execution is impressively high, even on a 
personal computer. Collections of macros for use with this new version are 
called toolboxes, and at the time of writing a Control System Toolbox (Little 
and Laub, 1985), a System Identification Toolbox, a Multivariable Frequency 
Domain Toolbox (Boyle et al., 1989) and a Robust Control Toolbox are 
available. 

An inherent limitation of Matlab is that the only data type which it 
supports is the (complex) matrix. Although this is enough for most of state- 
space based linear systems theory, there are clearly objects which arise in 
control engineering which cannot be naturally represented as matrices. 
Obvious examples are multivariable time and frequency responses, which are 
more naturally represented as three-dimensional arrays. It is well known that 
the development of software is greatly eased if powerful data-structuring 
facilities are available, and such facilities form the central feature of modern 
programming languages such as Pascal or Ada. An interesting new package, 
which uses the Matlab style of interaction with the user, but which is written 
in Ada and offers the user an Ada-like interactive language, is Impact 
(Rimvall, 1987). This supports control-specific data types such as transfer- 
function matrices, linear-system descriptions and trajectories. However, this 
package is still under development, and for the time being must be viewed as 
a promising experiment in computer-aided design. Another package which 
aims to provide Matlab-like features with the addition of control-specific 
data structures is Basile (Delebecque et al., 1988). 

All the packages mentioned in this section, and many others, are listed 
in the ELCS, mentioned in Section 8.4.1. The ELCS includes information on 
the availability of packages, their functionality, hardware requirements and 
contact addresses. In this section we have discussed only those software 
packages which are specifically designed to support design techniques of the 
kind described in this book. There are, of course, many packages aimed at 
other control-engineering activities, the principal one being simulation. For 
information about these the reader is again referred to the ELCS. 

Other sources of information about the state of computer-aided 
control-system design up to 1988 are: Herget and Laub (1982), Herget and 
Laub (1984), Jamshidi and Herget (1985), Hansen and Larsen (1985), IEEE 
(1986) and Chen (1988). 


8.4.3 Current trends 


Most of the activity in software development for control-system design which 
is currently going on, is concerned with taking full advantage of the rapid 
changes which are occurring in the computing environment, such as the 
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advent of very powerful personal workstations. This activity is concentrated 
in three areas: integration of software packages into unified control-engineer- 
ing ‘environments’; increased use of high-resolution graphics, particularly for 
data input and problem specification; and the use of techniques from artificial 
intelligence. There is also some activity which is driven by theoretical 
advances—software packages for H,, design and adaptive control are be- 
ginning to appear, for example, and research is being pursued into the use of 
parallel computing for handling models with very large numbers of states, 
such as those obtained from finite-element software. 

The interest in integration arises from the difficulties which exist at 
present with following a project through the stages of modelling and system 
identification, analysis and design, simulation and implementation. Software 
is available to support each of these stages, but there is frequently difficulty in 
transferring data between the various software packages in use. In a large 
project there is also the difficulty of keeping track of the data generated 
during the lifetime of a project. Several of the commercially available pack- 
ages offer interfaces to other packages, two-way transfer of data between 
design and simulation software being quite common. 

Within a few years we are likely to see control-oriented software 
environments which offer sophisticated data-management facilities, uniform 
interfaces to constituent packages, and considerable freedom to tailor such an 
environment to one’s own needs by importing packages of one’s own choice. 
To facilitate such a development, the International Federation of Automatic 
Control (IFAC) has recently begun the process of defining standards for data 
structures, user interfaces and numerical algorithms. Some engineering cor- 
porations have already made progress in this direction (Spang, 1984), and in 
the UK an integrated environment of this kind, called Ecstasy, has been 
developed (Munro, 1988). 

Several proposals have been made to use techniques developed for 
expert systems to help in the design of control systems (James et al., 1985; 
Birdwell et al., 1985; Boyle et al., 1989; Pang and MacFarlane, 1987). These 
proposals recognize that control design is a complex activity, and that 
mathematical design of the kind described in this book is performed relatively 
rarely, even by practising control engineers — most of their efforts are devoted 
to modelling and implementation. There is therefore much scope for using an 
expert system to guide a designer through the design process, for example in 
the following sequence: 


(1) Examine plant characteristics, stability, and so on. 

(2) Define specification. 

(3) On the basis of 1 and 2, choose a suitable design technique -start with 
the simplest. 


(4) Attempt a design; if unsuccessful, return to 3 and try another 
technique. 
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(5) If iterations through 3 and 4 do not terminate, return to 2 and 
consider modifying the specification to remove inconsistencies. 


(6) If successful, move on to validation by means of simulation. 


At each of these steps the expert system may provide detailed expertise on 
how to use a particular software package, how to choose parameters for 
various algorithms, and so on. 

It is sometimes claimed that expert systems have the potential to solve 
difficult control problems, such as the problem of loop assignment in process 
control systems. The reader is advised to treat such claims with scepticism. 
Current expert systems have very little ability to learn, in the sense of being 
able to infer rules from observed behaviour; they can do little other than 
encapsulate knowledge which is already available. And at present we do not 
know how to solve ‘difficult’ control problems — that is why they are difficult. 
The solution of such problems must await suitable theoretical advances. 

The field of computer-aided control-system design is developing at 
such a rate that speculation about the future is of little value. To keep up with 
developments in this field there is no alternative to looking regularly at the 
relevant journals, particularly the IEEE’s Control Systems Magazine (includ- 
ing the advertisements), occasional publications such as the Benelux WGS 
Newsletter, and the proceedings of relevant conferences. The two regular 
international events in this field are the IFAC and IEEE Symposia on 
Computer-Aided Control System Design (CACSD), which are held at inter- 
vals of 3 years and about 18 months, respectively. 


SUMMARY 

Much of this chapter has been devoted to numerical analysis. The notions of 
the conditioning of a problem and the numerical stability of a solution 
algorithm have been introduced, and some fundamental algorithms from 
numerical linear algebra have been presented. These have then been 
applied to solving some problems which occur when analysing and de- 
signing feedback systems. The algorithms which have been presented range 
from very straightforward ones, for performing tasks such as representing 
interconnections of systems, to very sophisticated ones, such as the al- 
gorithm for finding transmission zeros. _ 

These algorithms are nowadays available in software products, usu- 
ally suitable for interactive use, so that the control engineer no longer needs 
to work with them as subroutines. But experience has shown that some 
knowledge of how these algorithms work is needed, since it is often 
necessary to be aware of their limitations. Some of the available software 
products have been described, and current trends in computer-aided con- 
trol system design have been reviewed. 
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For more information on ELCS, contact Professor D. K. Frederick, 
Rensselaer Polytechnic Institute, Troy, NY 12180, USA; Dr C. J. Herget, 
Lawrence Livermore National Laboratory, Livermore, CA 94550, USA; or 
R. Kool, Eindhoven University of Technology, NL-6600 MB Eindhoven, The 
Netherlands. 


EXERCISES 
a 


8.1 


8.4 


Let p(x) denote the exact solution of a problem p when given the data 
x, and let m(x) denote the solution when computed by a particular 
algorithm. 

Devise formal definitions of conditioning of the problem and 
numerical stability of the algorithm, and show from your definitions 
that, if p is well-conditioned and m is numerically stable, then the 
computed solution is close to the true solution. Assume that a suitable 
norm can be defined on the set of possible solutions. 


A state-space model has two inputs, three outputs, and n states. Its 
frequency response is to be evaluated at 50 frequencies, using Laub’s 
algorithm. How large does n have to be before the overhead of 
transforming to Hessenberg form becomes significant? If your 
machine can perform 10° complex floating-point operations per 
second, how long does the evaluation take with this number of 
states? How long would it take if the transformation to Hessenberg 
form were omitted? 


Evaluate and plot the frequency response and characteristic loci of the 
AIRC model, which is defined in the Appendix, at 50 frequencies 
between 0.01 and 100 rads~‘. Try this with both linear and logar- 
ithmic spacing of frequencies. 

For the three-input, two-output system defined by 


X¥=—-x+[11 1]u 


a4 T 1 01 0 
raiaj To w AO 
find an approximate right-inverse system which gives a good approxi- 
mation (in frequency response) up to 
(a) w = 10 
(b) w=10+ 


8.5 


8.6 


8.7 
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What is the condition number of the modified matrix D in each case? 
Check the frequency and step responses of the series con- 
nection of the inverse system with the original systern in each case. 


Describe in detail how van Dooren’s ‘staircase’ algorithm is used to 
remove the unobservable modes of a state-space model. 


Implement an algorithm for computing transmission zeros, given a 
state-space realization. (To do this without an inordinate amount of 
effort, you should use a matrix-oriented language such as Matlab.) 

Use your algorithm to find the zeros of the system 
(Kouvaritakis and MacFarlane, 1976) 


Spgs e 26> | Ee. 
ee ee a -8 —5 
A=} 0 2 0 2-2], B=| -3 0 
0 6-3 5 —6 1 5 
Orso. Bea %s -8 0 


0 -1 2 —-1 -1l 
C=| 1 1 1 0 -l 
0 3. —2 3 —ł 


, D= 


Note that this system is not square, so arbitrary perturbations of its 
elements will destroy any finite zeros. However, correct implemen- 
tation of the required numerical rank decisions will result in (almost) 
the correct solution being found. 


(Solution: —3, +4) 
Verify that, if 


| A Eo A a E e A 
—Q —At T >, T 2 Tai Ty. 0 S22 
and T,, is invertible, then P = T,,7;;' is a solution, of the algebraic 
Riccati equation 


ATP + PA — PBR~'B™P+Q=0 


If K = R`' BTP, verify that the eigenvalues of A — BK are the 
same as those of S,,. (See Section 5.2 for the significance of this.) 
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8.8 Use the model approximation method given in Section 8.3.9 to find a 
controller of lower order than the 17-state H,, controller found in 
Section 6.8. 

(Note: This controller has a mode at zero which is nearly unobserv- 
able; on some machines the realization will be effectively non-minimal, 
and the algorithm given in Section 8.3.5 may have to be used to 
remove this mode before a balanced realization can be obtained.) 
You should find that a 12-state approximation gives virtually the 
same performance as the original one, but that the performance begins 
to deteriorate as the number of states is reduced below 12. 
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APPENDIX 


Models used in 
Examples and 
Exercises 


A.1 Aircraft model AIRC 


This model has been taken from Appendix F of Hung and MacFarlane 
(1982). It represents a linearized model of the vertical-plane dynamics of an 
aircraft, and has three inputs, three outputs, and five states. 

The inputs are: 

ul: spoiler angle (measured in tenths of a degree) 


u2: forward acceleration (ms~ 7”) 


u3: elevator angle (degrees) 
The states are: 


x1: altitude relative to some datum (m) 
x2: forward speed (ms~') 

x3: pitch angle (degrees) 

x4: pitch rate (degs~') 

x5: vertical speed (ms~') 


The three outputs are just the first three states, which are to be controlled. 
The matrices A, B, C and D of the state-space model are: 
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= 


0 0 1.1320 0 —1.000 

0 ~—0.0538 —0.1712 0 0.0705 
A=| 0 0 0 1.0000 0 

0 0.0485 0 —0.8556  —1.013 

0 —0.2909 0 1.0532 —0.6859 | 


0 0 0 

—0.120 1.0000 0 

B= 0 0 0 
4.4190 0 —1.665 

1.5750 0 —0.0732 


A.2 Turbo-generator model TGEN 
This is a two-input, two-output, six-state model of a large turbo-alternator. 
The inputs are: 

ul: throttle-valve position 


u2: excitation control 
The outputs are: 


yl: generator terminal voltage 
y2: generator load angle 
The original model is described by Limebeer et al. (1979). A linearized, 


ten-state model is given by Hung and MacFarlane (1982). A multivariable 
controller for this model designed by the reversed-frame normalization 
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method is given by Hung and MacFarlane (1982), one designed by the 
characteristic-locus method is given by Limebeer and Maciejowski (1985), 
and one designed by the Nyquist-array method is given by Boyle et al. (1989). 
The model is also used in the on-line tutorial examples provided with the 
Multivariable Frequency-Domain Toolbox software (Ford et al., 1988). 

The model we present here is a six-state approximation to that ten- 
state model. It is open-loop stable, but has a resonance at 6.35 rads” +, witha 
damping factor of only 0.05. 

The matrices of the state-space model are: 


— 18.4456 4.2263 —2.2830 0.2260 0.4220 —0.0951 
—4.0977 —6.0706 5.6825 —0.6966 — 1.2246 0.2873 

1.4449 1.4336 —2.6477 0.6092 0.8979 — 0.2300 
—0.0093 0.2302 —0.5002 —0.1764 —6.3152 0.1350 
—0.0464 —0.3489 0.7238 6.3117 —0.6886 0.3645 
—0.0602 —0.2361 0.2300 0.0915 —0.3214 —0.2087 


0.2748 3.1463 
—0.0501 —9.3737 
—0.1550 7.4296 
B=| o0716 —49176 
—0.0814 — 10.2648 
0.0244 13.7943 
abot —0.7697 4.8850 4.8608 —9.8177 ae 
3.1013 9.3422 —5.6000 —0.7490 2.9974 10.5719 
0 
Go) 


A.3 Remotely piloted vehicle model RPV 


The following model of a remotely piloted vehicle is given by Safonov et al. 
(1981), who use it to illustrate LQG design; Safonov and Chiang (1988) use it 
to illustrate H,, design. It has two inputs, two outputs and six states. It is 
unstable, its poles being located at — 30, —30, — 5.67, —0,.26, + 0.69 + 0.25). 
The matrices of the state-space model are: l 
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— 0.02567 —36.617 —18.897 —32.090 3.2509 —0.76257 
9.257x 1075 —1.8997 0.98312 —7.256x10~* —0.1708 —4.965 x 107? 
0.012338 11.720 —2.6316 8.758 x 1074 —31.604 22.396 


A= 
0 0 1 0 0 0 
0 0 0 0 = 30 0 
0 0 0 0 0 —30 | 
0 0] 
0 O 
0 0 
B= 
0 0 
30 0 
0 i | 
p 1000 A 
C= 
000100 
a 
D= 
0 0 
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This is the first text to give a comprehensive and unified view of modern 
multivariable feedback theory and design. Balancing techniques with 
theory, the objective throughout is to enable the feedback engineer to 
design real systems. 


To provide a background to the field, the text begins with a review of 
‘classical’ feedback design and shows how assessments of stability, 
performance and robustness can be performed in the multivariable case. 
Coverage is then given to the major multivariable design techniques: 
frequency-domain multivariable design, the QFT method of Horowitz, 
LQG methods, the new and topical ‘H„’ methods and design by parameter 
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